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Abstract 

We pursue the task of developing a finite population counterpart 
to Eigen's model. We consider the classical Wright-Fisher model 
describing the evolution of a population of size m of chromosomes of 
length I over an alphabet of cardinality k. The mutation probability 
per locus is q. The replication rate is a > 1 for the master sequence 
and 1 for the other sequences. We study the equilibrium distribution 
of the process in the regime where 

I —> +oo , m — > +oo , q — > , 
iq a e]0,+oo[, — — > a £ [0, +oo] . 

We obtain an equation a%j){a) = Inn in the parameter space (a, a) 
separating the regime where the equilibrium population is totally 
random from the regime where a quasispecies is formed. We observe 
the existence of a critical population size necessary for a quasispecies 
to emerge and we recover the finite population counterpart of the er- 
ror threshold. The result is the twin brother of the corresponding 
result for the Moran model. The proof is more complex and it relies 
on the Freidlin-Wentzell theory of random perturbations of dynam- 
ical systems. 
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1 Introduction. 



The Wright-Fisher model is one of the most studied model in mathematical 
population genetics. In this work, we apply to a basic Wright-Fisher model 
the ideas presented in [3J for the Moran model, thereby pursuing the task 
of developing a finite population counterpart to Eigen's model. Numerous 
works have attacked this issue [TJ [5J [TUJ US HI] • Using different tech- 
niques, Deem, Hu and Saakian [53], Deem, Muiioz and Park [52], Musso 
[19] and Dixit, Srivastava, Vishnoi [6] considered finite population models 
which approximate Eigen's model when the population size goes to infinity. 
These models are variants or generalizations of the classical Wright-Fisher 
model of population genetics. The problem is to understand how the error 
threshold phenomenon present in Eigen's model in the infinite population 
limit shows up in the finite population model. We refer to the introduction 
of [3j for a detailed discussion of this question and the heuristics guiding 
our strategy. We consider here the classical Wright -Fisher model describ- 
ing the evolution of a population of size m of chromosomes of length I over 
an alphabet of cardinality k. The mutation probability per locus is q. The 
replication rate is a > 1 for the master sequence and I for the other se- 
quences. We study the equilibrium distribution of the process in the regime 
where 

I — > +oo , m — > +oo , q — > , 
vn 

tq -t a e]0,+oo[, — ->• a 6 [0, +oo] . 

We obtain an equation a ip(a) = In k in the parameter space (a, a) separat- 
ing the regime where the equilibrium population is totally random from the 
regime where a quasispecies is formed. We observe the existence of a crit- 
ical population size necessary for a quasispecies to emerge and we recover 
the finite population counterpart of the error threshold. It is a classical 
fact that the Moran model and the Wright-Fisher model have similar dy- 
namics. Indeed, the main result here is the twin brother of the main result 
of [3J, the only difference being the equation of the critical curve. While 
we could compute exactly the critical curve for the Moran model, here the 
critical curve is defined through a variational problem depending on the 
parameter a. Apart from this point, the scaling and the associated expo- 
nents are the same in both cases. This confirms a conjecture of [3J and it 
sustains the hope that this kind of analysis is robust. 

A potential application of the result concerns genetic algorithms. In- 
deed, the Wright-Fisher model is identical to the genetic algorithm without 
crossover. In her PhD thesis [21], Ochoa investigated the role of the er- 
ror threshold phenomenon for genetic algorithms and she concluded that 
there exists a relationship between the optimal mutation rate and the er- 
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ror threshold. The result proved here provides a theoretical basis for some 
heuristics to control efficiently the genetic algorithms proposed in [2]- 

On the technical side, the Wright-Fisher model is much more difficult 
to handle than the Moran model. In the Moran model, the estimates of 
the selection drift relied on a birth and death model introduced by Nowak 
and Schuster . In the Wright-Fisher model, the bounding processes are 
more complicated, they involve three dependent binomial laws. As the size 
of the population grows, their transition probabilities satisfy a large devia- 
tion principle, derived with the help of the classical Cramer theorem. In the 
set of the populations containing the master sequence, the process can be 
seen as the random perturbation of a discrete dynamical system. This dis- 
crete dynamical system is simply the sequence of the iterates of a rational 
map F : [0, 1] — > [0, 1]. Depending on the parameters, this map has either 
one stable fixed point or two fixed points, one stable and the other unsta- 
ble. This opens the way to the application of the general scheme developed 
by Freidlin and Wentzell [5] to study the random perturbations of dynam- 
ical systems. Originally, Freidlin and Wentzell studied diffusion processes 
arising as brownian perturbations of a differential equation. These pro- 
cesses are continuous time Markov processes with a continuous state space. 
However their approach is robust and it can be applied in other contexts. 
Kifer [T31 H3] reworked this theory in the discrete time case. Unfortunately, 
our bounding processes do not fit the hypothesis of Kifer's model, for the 
following two reasons. In Kifer's model, the large deviation rate function 
of the transition probabilities is not allowed to be infinite, and the large 
deviation principle for the transition probabilities is assumed to be uni- 
form with respect to the starting point. Certainly the general framework 
considered by Kifer could be adjusted to include our case, with the help 
of some relaxed hypothesis. Yet in our case, we have only two attractors, 
one unstable and one stable, and we need only two specific estimates from 
the general theory, which is concerned with a finite number of attractors 
of any type. In fact, the kind of estimates we need have been computed in 
two other works handling closely related models. In an unpublished work 
[U (transmitted to me by courtesy of Gregory Morrow), Darden analyzed 
a Wright-Fisher model with two alleles and no mutation with the help of 
the Freidlin- Wentzell theory. What we have to do essentially is to obtain 
results analogous to Darden for the model with mutations. Morrow and 
Sawyer j!8) considered a more general model of Markov chains evolving in 
a convex subset of M. d around one stable attractor. Our bounding processes 
would fit this framework, were it for the uniform assumption on the vari- 
ance of the transition probabilities. In our case, this condition is violated 
close to the unstable attractor 0. We can apply their results outside a 
neighborhood of 0, but this would lead to a messy construction. It appears 
that, in any case, if we try to apply the results of Kifer or of Morrow and 
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Sawyer, we have to make a specific study of our process in the vicinity 
of the unstable fixed point 0. In the end, it seems that the most efficient 
presentation consists in deriving from scratch the required estimates, fol- 
lowing the initial ideas of Frcidlin and Wentzell. The techniques involved 
in the proof are classical and go back to the seminal work of Frcidlin and 
Wentzell. However there is an important simplifying feature in our case. 
Indeed, the bounding processes are monotone. This allows to avoid uniform 
large deviation estimates and to provide substantially simpler proofs. 

We describe the model in the next section and we present the main 
result in section [3J The rest of the paper is devoted to the proofs. The 
global strategy is identical to the case of the Moran model. The lumping 
is performed in section |4j In section we build a coupling and we prove 
the monotonicity of the occupancy process. This allows us to define simple 
bounding processes in section [SJ Section [7] which analyzes the dynamics 
of the bounding processes is much more complicated than for the Moran 
model. Section[5]is slightly simpler than for the Moran model. Some results 
for the mutation dynamics from [3J are restated without proof, otherwise 
the estimates in the neutral case are adapted easily to the case of the 
Wright -Fisher model. 

2 The Wright— Fisher model. 

Let A be a finite alphabet and let k = card .4 be its cardinality. Let 
( > 1 be an integer. We consider the space A e of sequences of length I 
over the alphabet A. Elements of this space represent the chromosome of 
an haploid individual, or equivalently its genotype. In our model, all the 
genes have the same set of alleles and each letter of the alphabet A is a 
possible allele. Typical examples are A= { A, T, G, C } to model standard 
DNA, or A — { 0, 1 } to deal with binary sequences. Generic elements of 
A e will be denoted by the letters u,v,w. A population is an m-tuple of 
elements of A . Generic populations will be denoted by the letters x, y, z. 
Thus a population a; is a vector 



whose components are chromosomes. For i£ {l,...,m}, we denote by 



x = 




x(i,l),---,x(i,£) 
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the letters of the sequence x(i). This way a population x can be represented 
as an array 

(x{l,l) ■■• x(l,£)' 



\x(m, 1) • • • x(m, I) j 

of size m x £ of elements of A, the i-th line being the i-th chromosome. The 
evolution of the population is random and it is driven by two antagonistic 
forces: replication and mutation. 

Replication. The replication favors the development of fit chromosomes. 
The fitness of a chromosome is encoded in a fitness function 

A: A 1 -> [0,+oo[. 

With the help of the fitness function A, we define a selection function 
F : A 1 x (A e ) m ->■ [0, 1] by setting 

Vu&A 1 Vxe(A e ) m 

F{U,X) = A(x(l)) + ■ ~ + A(x(m)) i lx(l)=u ■ 

The population x being fixed, the values F(u, x), u £ A e , define a probabil- 
ity distribution over A 1 . The value F(u, x) is the probability of choosing u 
when sampling from the population x. 

Mutation. The mutation mechanism is the same for all the loci, and 
mutations occur independently. We denote byg G]0, 1— 1/k[ the probability 
that a mutation occurs at one particular locus. If a mutation occurs, then 
the letter is replaced randomly by another letter, chosen uniformly over the 
k — I remaining letters. We encode this mechanism in a mutation matrix 

M(u,v), u,veA e , 

where M(u, v) is the probability that the chromosome u is transformed by 
mutation into the chromosome v. The analytical formula for M(u,v) is 



M(u,v) = Y[ ( (1 - q)l u(3)=vU) + 



Transition matrix. We consider the classical Wright-Fisher model. In 
this model, generations do not overlap. The mechanism to build a new 
generation is divided in two steps. In the first step, m chromosomes are 
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sampled with replacement from the population. The sampling law is given 
by the selection function. In the second step, each chromosome mutates 
according to the law specified by the mutation matrix. For n > 0, wc denote 
by X n the n-th generation. The Wright-Fisher model is the Markov chain 
(X n ) ne ^ on the space (A ) whose transition matrix is given by 

VneN Vx,ye(A e ) m 

P(X n+1 = y | X n = x) = Yl ( £ F(u, x) M(u, . 

l<i<m u£A e 

The other non diagonal coefficients of the transition matrix are zero. The 
diagonal terms are chosen so that the sum of each line is equal to one. 



3 Main results. 

We present the main results in this section. 

Sharp peak landscape. We will consider only the sharp peak landscape 
defined as follows. We fix a specific sequence, denoted by w* , called the 
wild type or the master sequence. Let a > 1 be a fixed real number. The 
fitness function A is given by 



Vit G A e A(u) = 



1 if u 7^ w* 
a if u = w* 



Density of the master sequence. We denote by N(x) the number of 
copies of the master sequence w* present in the population x: 

N(x) = card { i : 1 < i < m, x(i) = w* } . 

We are interested in the expected density of the master sequence in the 
steady state distribution of the process, that is, 

Master(er,€, m,p) = lim e( — N(X n j) , 

as well as the variance 

Variance ( a, £, m,p) = lim E ( ( — N(X n ) — Master(cr, t , m,p) 
n— >oo y\m 

The ergodic theorem for Markov chains ensures that the above limits exist. 
We denote by I(p, t) the rate function governing the large deviations of the 
binomial law of parameter pg [0,1], given by 

Vie [0,1] I(p,t) = tln- + (l-t)ln— — . 



6 



We define, for a G]0, +00 [, 



- if aer a > 1 



P*(a) = { a-1 

if aer a < 1 



l-l 



*(«) = inf inf { £/( _ °» +lk l(e 



a Pk+l 



k=0 



. [a - 1 )pk + 1 / V 7fc 

po = p*(a), pi = 0, /3 fc , 7^ G [0, 1] for < k < I } . 

The function "J is finite positive on ]0,ln<r[ and it vanishes on [In a, +00 [. 

Theorem 3.1 We suppose that 

i — > +00 , m — ^ +00 , q —t , 

in such a way that 

TTl 

iq -> a G]0, +oo[ , — - -> a G [0, +00] . 

We have the following dichotomy: 

• If aip(a) < In k then Master(cr, £, m, q) — > 0. 

• If aip(a) > In n then Master(er, £, m, q) — > p*(a). 
In both cases, we have Variance(<7, £, m, q) — > 0. 

The statement of the theorem holds also in the case where a is null or 
infinite, but a must belong to ]0,+oo[. This result is very similar to the 
result for the Moran model. Therefore all the comments done for the Moran 
model apply here as well. The main difference is that the function vE'(a) is 
more complicated. While we could obtain an explicit formula in the case of 
the Moran model, here the function ^(a) is the solution of a complicated 
variational problem. The general structure of the proof is similar to the 
one for the Moran model. We use the lumping theorem to reduce the size 
of the state space. We couple the lumped processes with different initial 
conditions. The coupling for the occupancy process is monotone. We 
construct then a lower and an upper process. These processes behave like 
the original process in the neutral region and like a Wright-Fisher model 
with two alleles whenever the master sequence is present in the population. 
The dynamics of these models is analyzed with a specific implementation 
of the Frcidlin-Wentzcll theory. We compute estimates of the persistence 
time of the master sequence, as well as its equilibrium density. Although 
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the results are similar to the case of the Moran model, this part is much 
more technical in the case of the Wright-Fisher model. Indeed, in the case 
of the Moran model, we needed simply to estimate some explicit formula 
associated to the birth and death model introduced by Nowak and Schuster 
[2U] . The approach used here to handle the Wright-Fisher model is quite 
robust and it should work for other variants of the model. In the final 
section we analyze the discovery time of the master sequence. This part is 
similar to the case of the Moran model, it is even simpler. 

4 Lumping 

We denote by dn the Hamming distance between two chromosomes: 
Wu, v £ A 1 d H (u, v) = card { j : 1 < j < C, u(j) ^ v(j) } . 
We define a function H : A 1 — > { 0, . . . , £ } by setting 
Vit G A 1 H{u) = d H (u,w*) . 
We define further a vector function H : (A e ) m — > { 0, . . . , I } m by setting 

H(x(l)) 




V.t = : g (A*) M(x) = 



4.1 Mutation and replication 

We state some results on the mutation matrix that have been proved in [3] . 
The mutation matrix is lumpable with respect to the function H. Let 
b,c £ { 0, . . . ,£ } and let u G A 1 such that H(u) = b. The sum 

H(w)=c 



does not depend on u in H ({&}), it is a function of b and c only, which 
we denote by Mn(b, c). The coefficient Mu(b, c) is equal to 




o<z<& 

k—l—c—b 
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The fitness function A of the sharp peak landscape can be factorized 
through H. If we define 



V6e{0,...,n A H (b) = 

then we have 



a if b = 
1 if b > 1 



VueA' A(u) = A H (H(u)). 

The selection function F can also be factorized through H . We define a 
selection function F H : { 0, . . . , I } x { 0, . . . , £ } m -> [0,1] by setting 

Vfc 6 {0, ...,£} Vde{0,...,£} m 

F H (k,d) = AHm) + H{k +AH{d{m)) E ^0=*- 

l<i<m 

We have then 

Vke{0,...,£} Vxe(A e ) m E F(u,x) = F H (k,M(x)) . 

ueA e 

H(u)=k 

4.2 Exchangeability 

The symmetric group & m of the permutations of {l,...,m} acts in a 
natural way on the populations through the following group operation: 

Vx e (A e ) m Vpe& m Vj G { 1, . . . , m } ( P - x)(j) - x(p(j)) . 

A probability measure p on (A e ) m is exchangeable if it is invariant under 
the action of & m : 

Vp G & m Vx € (A) p(p ■ x) = p(x) . 

A process (X n ) n >Q with values in (A e } m is exchangeable if and only if, for 
any n > 0, the law of X n is exchangeable. 

Lemma 4.1 The transition matrix p is invariant under the action of © m : 
Va;,yG(^) m Vp e 6 m p(p ■ x, p ■ y) = p(x,y) . 
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Proof. Let p € @ m and let x,y € (A e ) m . We have 

ze(A e ) m l<i<m 

= E II (^(p-^CO.p-^AfCp-^O.p-yW)) 

zG(*4 f ) m l<i<m 

ze^')™ l<i<m 

Thus the matrix p satisfies the required invariance property. □ 

Corollary 4.2 Let p be an exchangeable probability distribution on the 
population space (A) . The Wright-Fisher model (X n ) n > starting with 
p as the initial distribution is exchangeable. 

4.3 Distance process 

We define the distance process (D n ) n >o by 

Vn > D n = M(X n ) . 

We prove next that the Markov chain (J„) n >o is lumpable with respect to 
the partition of (A e ) induced by the map H, so that the distance process 
(Z?„)„>o is a genuine Markov chain. 

Proposition 4.3 (H Lumpability) Let p be the transition matrix of the 
Wright -Fisher model. We have 

Vee{0,...,n m Vx,ye(A e ) m , 

H(x)=H(y) =► P( x > z )= E P^' 2 )- 

ze(A e ) m ze(A e ) m 

H(z)=e H(z)=e 
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Proof. Let d, e £ { 0, . . . , I } m and let x £ (A e ) m be such that H(x) = d. 
We have 

J2 p(x,z)= E II (E F M) M M))) 

H(z)=e H(z)=e 

= II (E'M E M M) 

l<i<m ueyl f i>e.4 £ 

H(,;)=e(i) 

= II ( E i^)M ff (tf(u),e(i))) 

l<z<m 

= II ( E E F( U) x)M H (k,e(i))) 

l<i<m fce{0, ...,£} uGA" 
H(u)=k 

= II ( E F H (k,W(x))M H (k,e(i))y 

l<i<m k£{0,...,e} 

This quantity is a function of H(x) = (i and e only. □ 

We apply the classical lumping result to conclude that the distance process 
(D n )„>o is a Markov chain. From the previous computations, we see that 
its transition matrix pn is given by 

Vd,e G {0,...,<?} m 

p H {d,e) = J] ( E MM^MM*)))- 

l<i<m fcg{0,...,^} 

4.4 Occupancy process 

We denote by Vgl 1 the set of the ordered partitions of the integer m in at 
most I + 1 parts: 

V? +1 = { (o(0), . . . , o(0) € t^ +1 : o(0) + • • • + o(£) = m } . 

These partitions are interpreted as occupancy distributions. The partition 
(o(0), . . . , o(£)) corresponds to a population in which o(l) chromosomes are 
at Hamming distance I from the master sequence, for any / £ { 0, . . . , I }. 
Let O be the map which associates to each population x its occupancy 
distribution 0(x) = (o(x, 0), . . . , o(x, £)), defined by: 

V/ G {0,...,^} o(x,l) = cardji : 1 < i < m, d H (x{i) 7 w*) = 1} . 

The map O can be factorized through H. For d £ { 0, . . . , i } m , we set 

on(d, I) = card { i : 1 < i < m, = I } 
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and we define a map Oh ■ { 0, . . . , £ } m — > PE^ by setting 
H {d) = (o H (d,0),...,o H (d,£)). 

We have then 

Vxg {A l ) m O(x) = O h (M(x)) . 

The map O lumps together populations which are permutations of each 
other: 

\fx g {A l ) m Vp £ & m O(x) = 0{p ■ x) . 
We define the occupancy process (O„)„>o by setting 

Vn>0 O n = 0(X n ) = H {D n ). 



Proposition 4.4 (O Lumpability) Let pn be the transition matrix of 
the distance process. We have 



PH(eJ). 

/e{o,...,£} m 

OhU)=o 

Proof. Let o £ V^ +1 and d, e £ { 0, . . . ,£ } m such that H (d) = H (e). 
Since Ou(d) = 0#(e), then there exists a permutation p £ & m such that 
p ■ d = e. By lemma 14. 1\ the transition matrices p and pu are invariant 
under the action of & m , therefore 



Vo£P;" +1 Wd,e £ {0,...,£} m , 

H (d)=0 H (e) PH(d,f) = 

fe{o,...J} m 

OhU)=o 



X] PH(d,f)= X p H (p-d,p-f) 
fe{o,.. .,£}"> /e{(v..,n m 

OhU)=o OhU)=o 

Y Pn(eJ) = PH(eJ) 
/6{0,. ..,£}"* /e{o,...,^} m 

as requested. □ 

We apply the classical lumping result to conclude that the occupancy pro- 
cess (O n )n>o is a Markov chain. Let us compute its transition probabilities. 
We define a selection function Fo : {0, ...,£} x Vg] rl — > [0, 1] by setting 

Vfce{0,...,£} V GPf +1 F {k,o) - 



Eo<? 1 <£°( /l ) 
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Wc have then 

VfcG{0,...,n Vd€{0,...,l} m F {k,0 H {d)) = F H (k,d). 
Let o e Vf +X and d e { 0, . . . , I } m be such that O h (d) ^ o. We write 

PH(d,e)= J2 II ( E F H (k,d)M H (k,e(i))) 

e<E{0,...,i} m ee{0,...,l} m l<i<m k£{0,...,£} 

H (e)=o H (e)=o 

= n ( e F (k,o H (d))M H (k,h)y {h) . 

o<h<e ke{o,-,£} 

The last quantity depends only on Oh id) and o as requested. We conclude 
that the transition matrix of the occupancy process is given by 

Vo,o' G VT+i Po(o,o') = II ( E F (k,o)M H (k,h)y ih) 

o<h<e fee{o,.../} 



5 Monotonicity 

A crucial property for comparing the Wright-Fisher model with other pro- 
cesses is monotonicity We will realize a coupling of the lumped Wright- 
Fisher processes with different initial conditions and we will deduce the 
monotonicity from the coupling construction. All the processes will be 
built on a single large probability space. We consider a probability space 
(fl, J- ', P) containing the following collection of independent random vari- 
ables, all of them following the uniform law on the interval [0, 1]: 

U% j , n>l, 1 < i < m , 1 < j < I , 
S l n , n > 1 , 1 < i < m . 

To build the coupling, it is more convenient to replace the mutation prob- 
ability q by the parameter p given by 

K 

P = 7 <?• 

K — 1 

5.1 Coupling of the lumped processes 

We build here a coupling of the lumped processes. We set 

/ S" 1 TT 1 ' 1 IT 1 ' 1 - \ 
Vn > 1 R n = : : ... : . 

\ Cm Trm,l Tjm,£ I 
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The matrix R n is the random input which is used to perform the n-th 
step of the Markov chains. We denote by 1Z the set of the matrices of size 
m x (£ + 1) with coefficients in [0, 1]. The sequence (R n )n>i is a sequence 
of independent identically distributed random matrices with values in 1Z. 

Mutation. We define a map 

Mb:{0,...,£}x [0,l] £ ^{0,...,n 

in order to couple the mutation mechanism starting with different chromo- 
somes. Let b € { 0, . . . ,£ } and let m, ■ ■ ■ , ui £ [0, l] e . The map Mh is 
defined by setting 

b i 

Miiib, ui,...,Uj) = b — l Uk < P /K + l« fc >i-p(i-i/K) ■ 

k=l k=b+l 

The map Mh is built in such a way that, if Ui , . . . , Ug are random variables 
with uniform law on the interval [0, 1], all being independent, then for any 
b e { 0, . . . ,£}, the law of Mn(b, Ui, . . . , Ug) is given by the line of the 
mutation matrix Mh associated to b, i.e., 

Vce {(),...,£} P(M H (b,U 1 ,...,Ug) =c) = M H (b,c). 

Selection for the distance process. We realize the replication mecha- 
nism with the help of a selection map 

S H :{0,...,£} m x [0,1] ->{!,•■-, "*}• 

Let d e { 0, . . . , £ } m and let s E [0, 1[. Wc define S H (d, s) = i where i is 
the unique index in { 1 , . . . , m } satisfying 

A„(d(l)) + ■■■ + A H (d(i - 1)) A H (d(l)) + ■■■ + A H (d(i)) 

< s < 



A H (d(l)) + ■ ■ ■ + A H (d(m)) ~ A H (d(l)) + --- + A H (d(m))- 

The map Sh is built in such a way that, if S is a random variable with 
uniform law on the interval [0, 1], then for any { 0, . . . , £ } m , the law of 
Snid, S) is given by 

Vze{l,...,m} P{S H (d,S) = i) = 



A H (d(l)) + --- + A H (d(m)) ' 
Coupling for the distance process. We build a deterministic map 
* H :{0,...,£} m x K^r {0,...,£} m 
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in order to realize the coupling between distance processes with various 
initial conditions. The coupling map ^ h is defined by 

VreTZ, Vde{0,...,£} m 

/ M H (d(S H (d,r(l,l))),r(l,2),...,r(l,£+l))) 
V„{d,r) = : 

\M H (d(S H (d, r(m, 1))), r(m, 2), . . . , r(m, * + 1))), 

The coupling is then built in a standard way with the help of the i.i.d. 
sequence (R n ) n >i and the map ^>h- Let d G { 0, . . . , £ } m be the starting 
point of the process. We build the distance process (D n ) n >Q by setting 
D = d and 

Vn>l A, = * ff (A-i,i?„). 

A routine check shows that the process (-D ra )n>o is a Markov chain starting 
from d with the adequate transition matrix. This way we have coupled the 
distance processes with various initial conditions. 

Selection for the occupancy process. We realize the replication mech- 
anism with the help of a selection map 

So :VF+iX [0,1] ^{0,. ..,£}■ 

Let o G VY\-i and let s G [0, 1[. We define So(o, s) — I where I is the unique 
index in { 0, . . . , £ } satisfying 

o(0)A H (0) + ■ • • + o(l - l)A H (l - 1) < o(0)A H (0) + ■■■+ o(l)A H (l) 



o(0)A H (0) + • • • + o{£)A H {£) o(0)i ff (0) + ■ ■ • + o{£)A H {£) ' 

The map So is built in such a way that, if S is a random variable with 
uniform law on the interval [0, 1], then for any o G 'P£J. 1 , the law of So(o, S) 
is given by 

Coupling for the occupancy process. We build a deterministic map 

* G : VT +1 xTZ^ vr+x 

in order to realize the coupling between occupancy processes with various 
initial conditions. The coupling map $ o is defined by 

Vr G 11 , Vo G VT+i 

( M ff (5o(o,r(l,l)),r(l,2),...,r(l,€ + 1))) 



*o(o,r) = O h 



\Mh(So(o, r(m, 1)), r(m, 2), . . . , r(m, * + 1))), 
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Let o e be the starting point of the process. We build the occupancy 
process (O n )„>o by setting Oq = o and 

Vn>l O n = * (On-l,-Rn) • 

A routine check shows that the process (O n )n>o is a Markov chain starting 
from o with the adequate transition matrix. This way we have coupled the 
occupancy processes with various initial conditions. 

5.2 Monotonicity of the model 

We first recall some standard definitions concerning monotonicity and cor- 
relations for stochastic processes. A classical reference is Liggett's book 
[T5] . especially for applications to particle systems. In the next two defini- 
tions, we consider a discrete time Markov chain {X n ) n > with values in a 
space £ . We suppose that the state space £ is finite and that it is equipped 
with a partial order <. A function / : £ — > M is non-decreasing if 

Vx,y€£ x<y => f(x) < f(y) . 



Definition 5.1 The Markov chain (X n ) n > is said to be monotone if, for 
any non-decreasing function /, the function 

xe£^E(f(X n )\X = x) 

is non-decreasing. 

A natural way to prove monotonicity is to construct an adequate coupling. 

Definition 5.2 A coupling for the Markov chain (A„)„>o is a family of 
processes (A"^)„>o indexed by x G £, which arc all defined on the same 
probability space, and such that, for x £ £, the process (AT^)„> is the 
Markov chain (X n ) n >o starting from Xq = x. The coupling is said to be 
monotone if 

Vx,ye£ x<y => Vn > 1 X* < X% . 

If there exists a monotone coupling, then the Markov chain is monotone. 
The space { 0, . . . , £ } m is naturally endowed with a partial order: 

d<e <^=> Vie{l,...,m} d{i) < e(i) . 
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The map Mh is non-decreasing with respect to the Hamming class, i.e., 

V6,cG {0, ...,£} V«i,...,u/ G [0,1] 

b<c => MhQ>, ui, . . . ,ue) < Mh(c,Ux, . . . ,ug) . 

(See [3] for a detailed proof). In the neutral case a = 1, the map Sh does 
not depend on the population, in fact, 

Vd e { 0, . . . , £ } m Vs G [0, 1] S H (d, s) = \ms\ . 

As a consequence, we have 

Vd,e G {0,...,£} m VsG [0,1] 

d<e d(S H {d,s)) < e(S H (e,s)) . 



Lemma 5.3 In the neutral case a = 1, the map VP h is non-decreasing 
with respect to the distances, i.e., 

Vd, e G { 0, . . . ,£} m Wen, d<e => V H (d, r) < # H (e, r) . 

Proof. Let r G TZ and let d, e G { 0, . . . ,£ } m , d < e. Let i G { 1, . . . , m}. 
Since 

S H {d,r(i,l)) = S H (e,r(i,l)) = [mr(i, 1)J , 

then 

d(5 H (rf,r(i,l))) < e(5 H (e,r(i,l))). 
This inequality and the monotonicity of the map Mh imply that 

M H (d(S H (d, r(i, 1))), r(i, 2), . . . , r(i, I + 1))) 

< ^(e^Ccr^, l))),r(i,2), . . .,r(i,* + 1))) • 

Therefore ^n(d, r) < ^n(e, r) as requested. □ 

Corollary 5.4 In the neutral case <r = 1, the distance process (-D n )ra>o is 
monotone. 

Unfortunately, the map VP h is not monotone for a > 1. Indeed, suppose 
that 

K = 3 , cr = 2, to = 3, £ > 2 , 

2 3 3 3 

3 < Sl < 4 ' 4 < S2<1, 4 < S3 < 1 ' 



Vi g { 1,2, 3} Vje{i,...,£} u^-e 
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P 1 _ 2p 

3' 3 




then 

This creates a serious complication. To get around this problem, we lump 
further the distance process in order to build the occupancy process. It 
turns out that the occupancy process is monotone even in the non neutral 
case. We define an order ^ on 'P^_ 1 as follows. Let o — (o(0), . . . ,o(£)) 
and d = (o'(0), . . . , o'{t)) belong to VJ\ X . We say that o is smaller than or 
equal to o', which we denote by o -< o', if 

V/ < I o(0) + • • • + oil) < o'(0) + • • • + o'(Z) . 

As shown in [3], the map So is non-increasing with respect to the occu- 
pancy distribution, i.e., 

Vo, d G Vs G [0, 1] 

o<d Sq(o,s) > So(o',s). 



Lemma 5.5 The map ^o is non-decreasing with respect to the occupancy 
distributions, i.e., 

Vo,d GV^ VreTZ o<d *o(o,r) < *o(o',r). 

Proof. Let r E 1Z and let o, o' G 'PJ'+i be such that o ^ o'. Using the 
monotonicity of the map So, we have 

Vie{l,...,m} 5 (o,r(i,l)) > <S (V, r(i, 1)) . 
This inequality and the monotonicity of the map Mh imply that 

Vie{l,...,ro} M H (S (o,r(i,l)),r(i,2),...,r(i,e + l))) 

> M H {S (d,r(i, l)),r(i,2), . . . ,r(i,*+ 1))) . 

Therefore "^(o, r) < ^©(o', r ) as requested. □ 
Corollary 5.6 The occupancy process (O„)„>o is monotone. 
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5.3 The FKG inequality 

We consider here the product space { 0, . . . , £ } m equipped with the natural 
product order: 

d<e <^ Vie{l,...,m} d(i) < e(i) . 



Definition 5.7 A probability measure /i on { 0, . . . ,£} m is said to have 
positive correlations if for any functions /, g : { 0, . . . , £ } m — > K which are 
non-decreasing, we have 

J2 f{d)g{d)n{d)> ( J2 E 9(d) t*(d)). 

de{o,...,e}™ de{o,...4}™ de{o,...,i} m 

The Harris inequality, or the FKG inequality in this context, says that any 
product probability measure on { 0, . . . , £ } m has positive correlations. The 
FKG inequality is in fact true for any product probability measure on a 
product of the interval [0, 1] (see section 2.2 of Grimmett's book [H]). As 
far as correlations are concerned, there is not much to do with the original 
Wright-Fisher model, because its state space is not partially ordered. So 
we examine the distance process. 

Proposition 5.8 Suppose that we are in the neutral case a = 1. If the 
law of Do has positive correlations, then for any n > 0, the law of D n also 
has positive correlations. 

Proof. The Wright-Fisher model (X n ) n >o can be seen as a probabilistic 
cellular automaton. Indeed, given the population X n = x at time n, the 
individuals (X n+ i(i),l < i < m) of the population at time n + 1 are 
independent. This still holds for the distance process. By corollary 15 .41 
the neutral distance process (-D n ) ra >o is monotone. Monotone probabilistic 
cellular automata preserve the FKG inequality. This is explained in detail 
by Mezic [T7] and it was first observed by Harris [T^] at the very end of his 
article on continuous time processes. Because the argument is very short, 
wc reproduce it here. Suppose that the initial law \i of Dq has positive 
correlations. Let f,g : { 0, . . . , £ } m M be two non-decreasing functions. 
For any d £ { 0, . . . , £ } m , the conditional law of D\ knowing that Dq = d 
is a product measure on { 0, . . . , £ } m , thus it satisfies the FKG inequality, 
whence 

We {0,...,£} m 

E(f{D 1 )g(D 1 ) \D = d) > E(f(D 1 ) \D = d) E(g(D 1 ) \D = d). 
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We integrate the inequality with respect to the initial law (i: 



£ E(f(D 1 )g(D 1 )\D = d) f i(d) > 
de{o,...,t} m 

E(f(D 1 ) | D = d)E(g(D 1 ) \D = d) fi(d) . 
de{o,...J} m 

Since (D n ) n >o is monotone, the maps 

d^{Q,...,Z} m ^E(f(D 1 )\D Q = d), 
de{0,...,£} m ^E(g(D 1 )\D Q = d), 

are non-decreasing. By hypothesis, the initial law fi has positive correla- 
tions, therefore 

Y, E(.f(D 1 ) | D = d)E(g(D 1 ) \D = d) n{d) > 
de{o,...,e }™ 

( J2 E(f(Di)\Do = d)fi(d))( E(g(D 1 )\D = d) f x(d)). 

de{o,...,i} m de{o,...,e} m 

The two above inequalities imply that the law of D\ satisfies the FKG 
inequality. We conclude by iterating the argument. □ 



6 Stochastic bounds 

In this section, we take advantage of the monotonicity of the map to 
compare the process (O n )n>o with simpler processes. 

6.1 Lower and upper processes 

We shall construct a lower process (O^)„>o and an upper process (O^) n >o 
satisfying 

Vn > Ol^On^Ol. 

Loosely speaking, the lower process evolves as follows. As long as there is 
no master sequence present in the population, the process (O^Jn^o evolves 
exactly as the initial process (O n )n>o- When the first master sequence 
appears, all the other chromosomes are set in the Hamming class 1, i.e., 
the process jumps to the state (1, m — 1, 0, . . . , 0). As long as the master 
sequence is present, the mutations on non master sequences leading to non 
master sequences are suppressed, and any mutation of a master sequence 
leads to a chromosome in the first Hamming class. The dynamics of the 
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upper process is similar, except that the chromosomes distinct from the 
master sequence are sent to the last Hamming class I instead of the first 
one. We shall next construct precisely these dynamics. We define two maps 
7i*,7ri : V r t +1 -> V%_ x by setting 

Vo g V?+ x n e (o) = (o(0), 0, . . . , 0, m - o(0)) , 

tti(o) = (o(0),m-o(0),0,...,0) . 

Obviously, 

Vo g V'T +1 717(0) < o r< 7n(o). 

We denote by W* the set of the occupancy distributions containing the 
master sequence, i.e., 

W* = { o G 7^ : o(0) > 1 } 

and by A/" the set of the occupancy distributions which do not contain the 
master sequence, i.e., 

N = { o G VT +1 : o(0) = } . 

Let be the coupling map defined in section POl We define a lower map 
^ e by setting, for o G "P"+! and reK, 

!*o(o,r) if oGJV and * (o,r)^W* 

7T^(*o(o,r)) ifoeA/" and * (o,r)eW* 

717 (<f (717 (o),r)) if o G W* 

Similarly, we define an upper map by setting, for o G T^+x an< ^ r ^ 7?., 

!*o(o, r) ifoeA/" and * (o, r)^W* 

7ri(^ (o,r)) ifoeA/" and #0(0, r) e W* 

7Ti (#o(7ri(o), r)) ifoeW* 

A direct application of lemma l5.5l viclds that the map ^>q is below the map 
\l/o an d the map \l/ is above the map #0 m the following sense: 

MreK MoeVT+i *o(o,r) ■< *o(o,r) ^ # (o,r) . 

We define a lower process (O^) n >o and an upper process (0* )„>o with the 
help of the i.i.d. sequence (R n ) n >i and the maps \P , # as follows. Let 
o e VY+\ be the starting point of the process. We set Oq=Oq = o and 

Vn > 1 O e n = ¥ (pi_ x ,R n ) , Ol = ^oipl^Rn) . 
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Proposition 6.1 Suppose that the processes (O^) n >o, (O n ) n >o, (O^)„>o, 
start from the same occupancy distribution o. We have 

Vn > O l n ^O n < 0\ . 

Proof. We prove the inequality by induction over n £ N. For n = we 
have Oq = Oq = Oq = o. Suppose that the inequality has been proved at 
time n € N, so that O l n -< O n < 0\. By construction, we have 

On+l - *o(OtRn) , O n+1 = * O (0„ ;J R„) , 0\ +1 = VhiOlRn) ■ 

We use the induction hypothesis and wc apply lemma 15.51 to get 

Yet the map \Pq is below the map and the map vPq is above the map 
\&o, thus 

Putting together these inequalities we obtain that O l n+1 < O n+ i ^ 0} l+l 
and the induction step is completed. □ 

6.2 Dynamics of the bounding processes 

We study next the dynamics of the processes (O^)„>o and (O^)„>o in W*. 
The computations are the same for both processes. Throughout the section, 
we fix 9 to be either 1 or I and we denote by (O^)„>o the corresponding 
process. For the process (O^) n >o, the states 

T" = {»€ Vf+i : o(0) > 1 and o(0) + o{9) < m } 

are transient, while the populations in M U (W* \ T ) form a recurrent 
class. Let us look at the transition mechanism of the process restricted to 
W*\T 9 . Since 

W*\T 9 = { o e VT+i ■ o(0) > 1 and o(0) + o(0) = m } , 

we see that a state of W* \ T is completely determined by the first occu- 
pancy number, or equivalcntly the number of copies of the master sequence 
present in the population. From the previous observations, we conclude 
that, whenever (O^) n >o starts in W* \ T e , the dynamics of the number 
of master sequences (O^(0))„>o is markovian until the time of exit from 
W* \ T 6 '. We denote by [z^) n > a Markov chain on { 0, . . . , m } with the 
following transition probabilities: for h G { 1, . . . , m } and k £ { 0, . . . , m }, 

Vn > P{Z e n+1 = k | Z e n =h)= P{O e n+1 (0) = k | O e n (0) = h) , 
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and for h = and fee { 0, . . . , m }, 
Vn > P(Z« +1 = k | ^ = 0) = 



M H {e,o) k (i- m h (o,o)) 



m—k 



Let us denote by p 9 (h, k) the above transition probability and let us com- 
pute its value. We use the definition of the transition mechanism of (O„) n >o 
to get 

i 

P e (h,k)= Y,p 9 (.wm 

ie{0,...,m} j=Q 

where p e (h, i, j, k) is given by 



p e (h,i,j,k) = P 



Zi = h 



( i master sequences are selected , 
j master sequences do not mutate, 
k — j non master sequences 
V mutate into a master sequence 

m~\ (vh)'(m-k) m -' ^ M „ (0 , 0)J(1 _ M „ (0 , 0)) 



) 



*J ((a-l)h + m) m V7, 

The Markov chain (^„)n>o corresponds to the evolution of the number 
of master sequences in a Wright-Fisher model with two types, the master 
type having fitness a and the other type having fitness 1, and with the 
following mutation matrix between the two types: 

P(the master type mutates into the non master type) = 1 — Mh (0,0) , 
P(the non master type mutates into the master type) = Mh(9, 0) . 

We can also realize the Markov chain (Z^)„>o on our common probability 
space. We define two maps H , S 1 : { 0, . . . , m } — > V^li by setting 

Mi G { 0, . . . , m } 

S (£) = (i, 0, . . . , 0,m — i) , 5 1 (i) = (i, m — i, 0, . . . , 0) . 

Let i € { 0, . . . , m } be the starting point of the process. We set Z® = i and 

Vn>l Z e n = ^(S^.^i^O). 

This construction yields a Markov chain (Z^)„>o starting from i with the 
adequate transition matrix. Moreover the maps H , S 1 are non-decreasing. 
By lemma 15.51 the map is a ls° non-decreasing with respect to the 
occupancy distribution. We conclude that the above coupling is monotone. 
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Corollary 6.2 The Markov chain (Z^) n >o is monotone. 
6.3 Invariant probability measures 

Our goal is to estimate the law v of the fraction of the master sequence in 
the population at equilibrium. The probability measure v is the probability 
measure on the interval [0, 1] satisfying the following identities. For any 
function / : [0, 1] -> R, 

/ fdv = lim f(-N{X n )) = V f(-N(xj) a*(x) 
J[0,i] ™ Km 1 x6 ££ )m Vto 1 

where /x is the invariant probability measure of the Markov chain (X n ) n >o 
and N(x) is the number of copies of the master sequence w* present in the 
population x: 

N(x) = card { i : 1 < i < m, x(i) — w* } . 
In fact, the probability measure v is the image of [i through the map 

xe (A e ) m ^-N(x) G [0,11. 

m 

We denote by /j, , no , l^o ^ ne invariant probability measures of the Markov 
chains (0^)„>o, {O n )n>o, (Oi)n>o- The probability v is also the image of 
Ho through the map 

G VT+i h> -o(0) G [0, 1] . 
m 

Thus, for any function / : [0, 1] — >• M, 

/ /di/ = V /(£<°lU ( ) = lim /(io„(0)) . 

We fix now a non-decreasing function / : [0, 1] — >• R such that /(0) = 0. 
Proposition 16. II yields the inequalities 

Vn > f(-O ( n (0)) < /(-O«(0)) < /(-Oi(0)) . 
Taking the expectation and sending n to oo, we get 

E /(^Vo(o)< / fdu< E /(^U(o)- 
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We seek next estimates on the above sums. The strategy is the same for the 
lower and the upper sum. Thus we fix 9 to be either 1 or £ and we study 
the invariant probability measure (Xq. For the Markov chain (O^) n >o, 
the states of T e arc transient, while the populations in Afu (W* \ T ) 
form a recurrent class. Let o* xit be the occupancy distribution having m 
chromosomes in the Hamming class 8: 



V/e{0,...,n oL t (0 



m if I = 9 
otherwise 



The process (O^)„>o always exits W* \ T B at o® xit . This allows us to 
estimate the invariant measure with the help of the following renewal result. 



Proposition 6.3 Let (X n ) n >o be a discrete time Markov chain with values 
in a finite state space £ which is irreducible and aperiodic. Let (i be the 
invariant probability measure of the Markov chain (X n ) n >Q. Let W* be a 
subset of £ and let e be a point of £ \ W* . Let / be a map from £ to Hi 
which vanishes on £ \ W* . Let 

t* = inf { n > : X n e W* } , r = inf { n > t* : X„ = e } . 

We have 

This result is proved in detail in [3] . We apply the renewal result of propo- 
sition [O] to the process (O^) n >o restricted to Wu (W*\T e ), the set 
W* \ T e , the occupancy distribution of xit and the function o H> /(o(0)/m). 
Setting 

r* = inf { n > : O e n G W* } , 
r = inf { n > r* : = o c e xit } , 



we have 



4E/ 



O«(0) 



^0 ~~ "exit 



E{r\O 8 = o e exit ) 

Yet, whenever the process (O^)„>o is in W* \ T e , the dynamics of the 
number of master sequences (O^(0) )„>o is the same as the dynamics of 



25 



the Markov chain (Z^) ra >o defined at the end of section |6~^1 Let To be the 
hitting time of 0, defined by 

T = inf { n > : Z 6 n = } . 

The process (O^)„>o always exits W* \ at o^ x j t . Therefore t coincides 
with the exit time of W* \ T e after t* . Let i e { 1, . . . , m}. From the 
previous elements, we see that, conditionally on the event { O^»(0) = i\, 
the trajectory (O„(0), r* < n < r) has the same law as the trajectory 
{Z® n , < n < To) starting from Zq = i, whence 

£(r - r* | O*,(0) = i) = e(to I Zl = i) , 

<£/(^)l*<H-*(£'(£)l«-')- 

v n=T* 7 v n=0 7 

Conditioning with respect to O t * (0) and reporting in the formula for the 
invariant probability measure /Xq, we get 

E * ( E / (f ) | Z " = <) m* (0) = * I O e = <U) 

m 

E{T* I = f xit ) + J2 E ^0 I ^ = P (°r- (°) = * I = <4it) 
i=l 

We must next estimate these expectations. In section [7j we deal with the 
terms involving the Markov chain (Z^) n >o- In section [51 we deal with the 
discovery time t* . 

7 Approximating processes 

This section is devoted to the study of the dynamics of the Markov chains 
(Zn) n >o and (Zn) n >o- The estimates are carried out exactly in the same 
way for both Markov chains. As we said before, the Markov chain (Z^) n >o 
corresponds to the evolution of the number of master sequences in a Wright- 
Fisher model with two types. Throughout the section, we fix 8 = 1 or 8 = £ 
and we remove 8 from the notation in most places, writing simply p, Z n 
instead of p e ,Z^. 
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7.1 Large deviations for the transition matrix 

We first recall a basic estimate for the binomial coefficients. 



Lemma 7.1 For any n > 1, any k S { 0, . . . , n }, we have 



l n ix, — i \ i + A; In - + (n — k) In - — - 
k\{n — k)\ n n 



< 21nn + 3. 



Proof. The proof of this estimate is standard (see for instance [7]). 
Setting, for n £ N, 4>(n) = Inn! — nlnn + n, we have 



ln 



Inn! — In fc! — ln(n — k)\ 



M(n-k)\ 

n \nn—n+(p(n) — (fcln k—k+(j)(k)^ — ((n— k) ln(n— k) — (n— k)+<f>(n— fc)) 



-fc In (n — fc) ln 

n n 



+ cj>(n) - (f)(k) - (f>(n - k) . 



Comparing the discrete sum 



ln n! = hi k 



l<k<n 

to the integral 

r 

I In x dx , 

we see that 1 < 4>(n) < Inn + 2 for all n > 1. On one hand, 
</>(n) — 0(fc) — 4>(n — k) < Inn, 

on the other hand, 

(j){n) - cj)(k) - (f>(n -k) > 1 - (lnfc + 2) - (ln(n - k) + 2) > -3 - 2 Inn 

and we have the desired inequalities. □ 
For p, t € [0, 1], we define 

I(p,t) = fln- + (l-t)ln— —. 

P 1-P 



For t [0, 1], we set I(p,t) 
Wt e [0, 1] 



■foo. For p = 0, we have 
7(0, t) 



if t = 
+oo if t > 



The function I{p 1 ■) is the rate function governing the large deviations of 
the binomial distribution B(n,p) with parameters n and p. This is the 
simplest case of the famous Cramer theorem, which we recall next. 
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Lemma 7.2 Let X be a random variable whose law is the binomial dis- 
tribution B(n,p) with parameters n and p. For t £ [0, 1], we have 

lim -lnP(X = [nt\) = -I(p,t) . 

n— f oo fi 

Using the estimate of lemma 17.11 we have even 

V/ce {0,...,n} \nP(X = k) = -n/(p,-) +0(21nn + 3). 
We define two functions /, F : [0, 1] — s- [0, 1] by 

(<r — ljr + 1 
We define a function If : [0, l] 4 — > [0, +oo] by 

£(r, s, /?, i) = J(/(r), s) + s /(M ff (0, 0), + (1 - s)l(M H {0, 0), ^— ^) . 

This function depends on £ through the mutation probabilities Mh(0, 0) 
and Mh(0,0). Using the estimate provided by lemma [7TT1 and the expres- 
sion of p 9 , we see that 

Vh,i,j,k£ {0, ...,m} 



lnp(h,i,j,k) = -miff (-),-) -il(M H (0,0),l) 

V m—iJ 



-mlA —,—,—, — } + ®(h,i,j, k,m) 



.m m m m; 
where the error term Q(h, i, j, k, m) satisfies 

Vft, k £ { 0, . . . , m } |<fr(/i, i, j, fc,m)| < 61nm + 9. 

We consider the regime where 

£ -> +oo , m -> +oo , <?— !>0, ->• a e]0,+oo[. 

In this regime, we have for 6 = 1 or = £, 

Mff(0,0)^e-°, M H (0,O)^O, 

so that, for r, s, (3,t £ [0, l] 4 , 



It(r,s,/3,t) 



I(r,s,t) if/3 = i 
+oo if /3 ^ i 
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where the function I(r,s,t) is given by 



Vr,Me [0,1] 3 I(r,s,t) = /(/(r), S )+ S /(e- a ,-). 



Proposition 7.3 We define a function V\ on [0, 1] x [0, 1] by 

Mr, t G [0, 1] Vi (r, t) = inf | l(r, s, t) : s G [0, 1] } . 

The one step transition probabilities of (Z n ) n >o satisfy the large deviation 
principle governed by V\\ for any subset U of [0, 1], we have, for any n > 0, 

° 1 
- inf {Vi(r,t) :teU\ < lim inf — \nP(Z n+1 & mU \ Z n = \rm\ ) , 

l,m— yoo, q— >0 m 
iq^ra 

limsup —lnP(Z n+1 emU\Z n =[rm\) < - inf { Vi(r, t) : t G U } . 

£,m— >oo, (j— >0 TO 

Proof. Let r e [0, 1] and let U be a subset of [0, 1]. We have, for any 
n > 0, 

P{Z n+ i G mf/ | Z„ = |_rm|) = ^ p([rm\ , fc) 

feem(7n{ 0,...,m } 



ke{ 0,...,m} i=0 j=0 



From the previous inequalities, we have 



P(Z n+1 G mC/|Z„ = [rm\) 

< (m + l) 3 max | p(L rm J , J, fc) : < i < m, < j < i, k G ?ti£/ | 

n / . f . / 1 rmj i j k 

< m cxp I — m mm < lg I - 



< i < m, < j < i, k G m£/ 



For each m > 1, let i m , jm, k m be three integers in { 0, . . . , m } which realize 
the above minimum. By compactness of [0,1], up to the extraction of a 
subsequence, we can suppose that, as m goes to oo, 

f"m jm n km . 

>■ s, > (3 , > t. 

m m m 
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If /? < t then 

T / [rm\ i rn j m k m \ 
limsup -IA , — , — , < 

«,m-Kx>, g ->o \ m m m m J 

lq-^ra 

"*m jm 



limsup -fl-^)jfAf fi (fl,0), m m ) =-«>, 

,m-s.oo, g->0 ^ TO/ V J— — 



because 

jr. 



t "<m n 

lim sup In ■ 



(i-^W(0,o) 

V to / 

Thus we need only to consider the case where (3 = t. We have then 



hmsup -JW , — , — , — ) < —I{j(r),s) —sl[e ,- 

<,m->oo,«->-o V to to to to/ \ s 

This implies the large deviation upper bound: 
limsup — lnP(Z n+ i € mC7 | Z„ = |jtoJ) 

£,m— foo, g— >0 

< -inf {l(r,s,t) :se[0,l],teFj 
Conversely, let s,te [0, 1]. We have 

P(Z„+i = \ tm\ | Z n = \rm\) > p([rm\, [sm\, [tm\, \ tm\) 

1 / / I rm I I sm I I tm I I im I 
> -^exp -mlA± K~ 

77V V V TO TO TO TO 

> ^ exp ( - ^(/(M ), W ) _ [stoJ /(m*(0,0), JM) 

TO' \ V V TO / 777 / V [stoJ / 

— (TO — I STO I ) In ; ] 

1 L j ' 1-M H {9,0)J 
Taking In and sending m,£ to oo, we obtain 

lim inf — hi P(Z n+1 = \tm\ I Z n = \ rm\) > -l(r,s,t). 

e,m-H>o, q-yO TO v y 
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Suppose now that t belongs to U, the interior of U. For m large enough, 
the integer \ tm\ belongs to ml] . From the previous estimate, we have 

liminf — \nP(Z n+ i G mU I Z n = \ rm\) > —l(r,s,t) . 

Optimizing over s, i, we get the large deviation lower bound: 
lim inf — In P(Z n +i £ mU \ Z n = \ rm\) 

t,m->-oo,q^yO 171 



> -inf 



|/(r,s,t) : s £ [0,1], i e ?/} 



This finishes the proof of the large deviation principle. □ 

Proceeding in the same way, we can prove that the Z-step transition prob- 
abilities satisfy a large deviation principle. For I > 1, we define a function 
Vi on [0, 1] x [0, 1] by 

i-i 



k=0 



Vi(r,t) = inf | ^2l(p kl j kl p k+1 ) : p = r, pi = t, 

p fc ,7fce [0,1] for0<fc</}. 



Corollary 7.4 For I > 1, the /-step transition probabilities of (Z n ) n >o 
satisfy the large deviation principle governed by Vf. for any subset U of 
[0, 1], any r £ [0, 1], we have, for any n > 0, 

o i 
-inf \Vi(r,t) : t £ U) < liminf — kiP(Z n+ i £ mU \ Z n = I rml) , 

L J £,m->oo. q->0 771 V 

lim sup — In P(Z n+l £ mU | Z n = [rm\ ) < - inf { V t (r, t) : t € U } . 

t,m— >oo, <j->0 m 
iq-^ta 



The rate function for the one step transition probabilities is given by 

t - 
s> 

we see that 



r (r,s,t) = /(/(r),,s)+,s/(e- a ,^), 



I(r,s,t) = <=► s = f(r), e- a = - 

s 
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Therefore the Markov chain (Z„) n >o can be considered as a random per- 
turbation of the dynamical system associated to the map F: 

z e [0, 1] , Vn > 1 z n = F(«n_i) . 

Let us set 

- if aer a > 1 



p*(a) = p(e~ a ,0) = { a-1 

if aer a < 1 

If ae~ a < 1 the function F admits only one fixed point, O, which is stable. 
If ae~ a > 1 the function F admits two fixed points, O, which is unstable, 
and p*{a) which is stable. 

The natural strategy to study the Markov chain (Z n ) n >o is to use the 
Freidlin-Wentzell theory [9]. This theory has been initially developed in 
a continuous setting, for brownian perturbations of differential equations. 
This theory is quite robust and it can be applied well beyond this initial 
framework. A discrete version has been worked out by Kifer p~3j [14] . Un- 
fortunately, our model does not satisfy the initial hypothesis employed by 
Kifer. Indeed we have a degeneracy at and the convergence in the large 
deviation principle for the transition probabilities p is not uniform with re- 
spect to the starting point. In addition, the rate function might be infinite, 
for instance 

P(Z n+1 >^\Z n =0) = ]T (™)M H (6,0) k (l-M H (6,0)) m - k , 

k>m/2 ^ ' 

and since Mh(8, 0) goes to as £ goes to oo, we have 



lim -InP (Z n+1 > —\Z n = 0) 



-OO . 



For these reasons, the Markov chain (Z n ) n >o does not fit into the model 
studied by Kifer. Undoubtedly, it is possible to relax somehow the hy- 
pothesis of Kifer's model in order to include our case. However this would 
require a full check of the proofs and the final presentation would not be 
convenient. Moreover we do not need the general results concerning the 
perturbed dynamics in the presence of a finite number of attractors. In 
our case, we have one stable fixed point and one unstable fixed point. Our 
situation is similar to the case studied by Darden [3] and by Morrow and 
Sawyer [18) . In Dardcn's case, the mutations are not taken into account, 
so the model is slightly too restrictive. In their model, Morrow and Sawyer 
assume a lower bound on the variance of the process which is uniform with 
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respect to the starting point. This condition is violated here, because the 
variance vanishes at the unstable fixed point. In fact, we could possibly use 
the result of Morrow and Sawyer on a subset of the form { \8m\ , . . . , m } , 
where 8 > is fixed, but we would still need to study the process in the 
vicinity of 0. The other drawback is also that, apart from the hypothesis 
on the variance, their model is a bit too general for our purposes (they con- 
sider a Markov chain evolving in a bounded subset of It appears that, 
in any case, if we try to apply the results of Kifer or of Morrow and Sawyer, 
we have to make a specific study of our process in the vicinity of the unsta- 
ble fixed point 0, because we must control the hitting time of 0, uniformly 
over the starting point. Therefore we choose to make a self-contained proof 
of the estimates we need. We take advantage of the specific dynamics to 
make a simpler proof, namely we use the discrete structure of { 0, . . . , m } 
and the monotonicity of the model, instead of relying on compactness and 
a condition of uniform convergence. The crucial quantity to analyze the 
dynamics is the following cost function V. We define, for s,t £ [0,1], 

V(s,t) = inf Vi(s,t) = 
(-1 

inf inf j ^ l(p k , y k , p k+1 ) : p Q = s, pi = t, p k ,^ k £ [0, 1] for < k < I j . 

fe=0 



Lemma 7.5 Suppose that oe~ a > 1. For s,t £ [0, 1], we have V(s,t) = 
if and only if 

• either s = t = 0, 

• or there exists I > 1 such that t = F l (s), 

• or s 7^ 0, t = p*{a). 

Proof. Throughout the proof we write p* instead of p* (a). Let s,t G [0, 1] 
be such that V(s,t) = 0. Suppose first that s = 0. Since 1(0, j,p) = +oo 
unless 7 = p = 0, then any sequence (po, 7o, ■ • ■ , 7z) such that p = s = 
and 

(-1 

~Y^l{pk,lk,Ph+i) < +oo 

k=0 

has to be the null sequence, so that necessarily t = 0. We suppose next 
that s > 0. For each n > 1, let (pg , 7J , . . . , pf, n \) be a sequence of length 
l(n) in [0, 1] such that 

C(n)-1 
k=0 
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We consider two cases. If the sequence (Z(n)) n >i is bounded, then we can 
extract a subsequence 

( J>( n ) <K™) J>( n ) \ 

\Po '10 > ■ ■ ■ iPl(4,(n))> 

such that l(4>(n)) = I does not depend on n and for any k £ {0, ... ,1 — 1}, 
the following limits exist: 

hm p k y = p k , lim 7fc = 7fe ■ 



n— >oo n—*oc 



The map I being continuous, we have then 

Vfce{0,...,i-1} l(p k ,7k,Pk+i) = 0, 

whence 

Vfce{0,...,Z} Pk = F k (po). 

Since in addition po = s and pi — t, we conclude that t = F l (s). Suppose 
next that the sequence (l(ri)) n >i is not bounded. Our goal is to show that 
t = p* . Using Cantor's diagonal procedure, we can extract a subsequence 

n <t>( n ) \ 

\Po )7o ! ■ • ■ 'Pi(0(n))J 

such that, for any k > 0, the following limits exist: 

lim pt {n) = p k , lim 7^ (n) = j k . 

n— foo n— too 

The map / being continuous, we have then 

\fk > l(p k ^ k ,Pk+i) = 0, 

whence 

VA:>0 ^=^(^0). 

Let e > 0. We have 

I(p%f(p'),p*) = 0. 
The map / being continuous, there exists a neighborhood J7 of p* such that 

VpeC/ < I{p*J(p*),p) < e. 

Since s > 0, 

lim F n (s) = p* 

and 

3/i>l F h (s)eU. 
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lam pt w = F h (s) 6 U, 

n— fcso 



In particular, 

tpin 

- r 

o 

so that, for n large enough, 



and 



Letting successively n go to oo and e go to we obtain that V(p*,t) = 0. 
Let S G]0,p*/2[ and let U be the neighborhood of p* given by 

[/ = ]p*-*,p*+a[. 

Let a be the infimum 

a = inf { J(p ,7o,Pi) : Po G £/, 7o € [0, 1], pi ^ U } . 

The function / is continuous on the compact set U x [0, 1] x ([0, 1] \ U\ 
hence 

3(po.75»Pl) eFx [0,1] x ([0,1] a = /(pg, 70. Pi) - 

Since -F is non-decreasing and continuous, we have 

F(U) = F([p* -6,p* +6}) = [F(p*-6),F(p* + 6)}. 

Moreover we have 

p* - 5 < F{p* -8) < F{p* +5) < p*+S. 

Thus F(U) C U and necessarily p\ ^ F(pq) and a > 0. It follows that 
any sequence (po, 70, . . . , pi) such that 

(-1 

Po&U , ^2l(p k ,Jk,Pk+i) < a 

is trapped in U. As a consequence, a point i satisfying V(p*,t) = must 
belong to U. This is true for any 5 > 0, hence for any neighborhood of p* , 
thus t = p*. □ 

We shall derive estimates in the regime where 

£^+00, m— > +00 , q^0, £q ->• a e]0, +oo[ . 

Several inequalities will be valid only when the parameters are sufficiently 
close to their limits. We will say that a property holds asymptotically to 
express that it holds for £, m large enough and q small enough. 
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7.2 Persistence time 

In this section, we will estimate the expected hitting time To starting from 
a point of { 1, . . . , m }. This quantity approximates the persistence time of 
the master sequence w*. We recall that 

cre- a - 1 r 

*/ \ r -a n\ ; — if ere a > 1 

p*(a) = p{e a ,0) = \ (j 1 

if aer a < 1 

and that the functions /, F : [0, 1] — > [0, 1] are given by 

fir) = V ^- 1 , F(r) = e-/(r). 



Proposition 7.6 Let a G]0, +oo[ and let i G { 1, . . . , to}. The expected 
hitting time tq of starting from i satishes 

lim — \nE(T Q \Z = i) = V(p*(a),0). 

(,m—>-oo m 
q->0, Iq^a 

Proof. Before proceeding to the proof, let us explain the general strategy, 
which comes directly from the theory of Freidlin and Wentzell. To obtain 
the upper bound on the persistence time, we show that, starting from any 
point in { 1, . . . , to }, the probability to reach a neighborhood of in a finite 
number of steps is larger than 

exp ( — mV(p* , 0) — me\ . 

This way we can bound from above To by a geometric law with this param- 
eter (see lemma ITTT)) . To obtain the lower bound on the persistence time, 
we first show in lemma [7751 that . starting from any point, the process has 
a reasonable probability of reaching any neighborhood of p* before visit- 
ing 0. This estimate is quite tedious, because the process might start from 
Zq = 1, which is close to the unstable fixed point of F. Since we need 
to control the hitting time of starting from any point, such an estimate 
seems to be indispensable and it cannot be done in the more general situ- 
ations considered by Kifer [13] or Morrow and Sawyer JTS] without adding 
some extra assumptions. So we give a lower bound on the probability of 
following the iterates of a discrete approximation of F. With a Poisson 
fluctuation, the process jumps away from 0, then, because F is expanding 
in the neighborhood of 0, it reaches the point r\m after In to steps, for some 
7/ > 0, and with a finite number of steps, it lands in a neighborhood of p* . 
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We then study the excursions of the process outside a neighborhood of 
and p* . Whenever the process is outside such a neighborhood, it reenters 
the neighborhood in a finite number of steps with probability larger than 
1 — exp(— cm) for some c > depending on the neighborhood. Thus the 
process is very unlikely to stay a long time outside a neighborhood of the 
two attractors {0,p*}. In fact, the length of these excursions is bounded 
by a constant, up to negligible events. We consider the hitting time Ts of 
the ^-neighborhood of 0. Obviously we have r > tj. We look then at the 
portion of the trajectory starting at the last visit to the neighborhood of p* 
before reaching a neighborhood of 0. Such an excursion occurs at a given 
time with probability of order 

exp ( — mV(p* , 0) + me^j , 

therefore it is unlikely to occur before time exp (mV(p* , 0) — me). We 
start now with the implementation of this scheme. Throughout the proof 
we write p* instead of p*(a). We start by proving an upper bound on 
the hitting time. The next argument works in both cases oe~ a < 1 and 
ae~ a > 1. In the case ae~ a < 1, we have p* = and V(p*, 0) = 0, and the 
proof becomes simpler, there is no need to consider a path from p* to 0. 
Let e > 0. We have 

l(p*J(p*),p*) = 0. 
The map / being continuous, there exists 6 > such that 

Vpe] P * -d,p* + 8[ l(p,f(p),p*) < e. 

Moreover 

lim F™(1) = p* . 

n— »-oo 

Thus 

3h>l F h (l) e]p* -6,p*+S[. 
Let I > 1 and let (po, 70, . . . , pi) be a sequence in [0, 1] such that 

l-i 

Pa = P*, Pi =0, ^/(p fe ,7fc,p fc+ i) < V(p*,0) + e. 

We consider the sequence obtained by concatenating the two previous se- 
quences: 

to = 1,8 = f(l),t 1 = F(l),...,t h - 1 = F h - 1 (l),s h - 1 = /(i h _i), 
t h = F h (l),s h = f(t h ),t h+1 = p*,s h+1 = 70, 

th+2 = Pi,'" > tfl+l = Pl-l,Sh+l = 7i-l,ffc+j+l = Pi = . 
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We set j = h + 1 + 1 . This sequence satisfies 

t = l,t J - = > 5D/(*/fe,a*,**+i) <V( P *,0) + 3e. 

k=Q 

We have then 

P(Zj = | Z = m) > JJ p(Lmt fc J, [msfcj, [mtfe+ij, L mi fc+iJ) • 

fe=0 

Taking In, sending mtooo and using the estimate on the transition prob- 
abilities obtained in the proof of proposition 17.31 we have 



i-l 

lim inf 

t,m— Voo, q— >o m 



-\nP(Z 3 =0\Z Q = m) > -5^J(tfc,8fc,t fc +i) 

> -V(p*,0) - 3e. 



Thus, asymptotically, we have 

P(Zj =0\Z = m) > cxp ( - mV{p*,0) - 4mej . 

Using the monotonicity of the Markov chain (Z n ) n >o, we conclude that, 
asymptotically, 

Vi G { 1, . . . , m } P(Zj = | Zq = i) > exp ( - mV{p*, 0) - 4me 

We have thus a lower bound on the probability of reaching in j steps 
starting from any point in { 1, . . . , m }. For any n > 0, we have, using the 
Markov property, 



P(t >(n+ l)j | Z = m) = P ( T o > ( n + ^ Z «J = fc I Z ° = m ) 

h=l 

m 

= ^2 P(to > nj, Z nj = h, Z nj+1 ^ 0, ... Z {n+1]] ^0\Z Q = m) 

h=l 
m 

= ^2 p ( z nj+i 0, . . • Z {n+1)j ^ I r > nj, Z nj = h, Z = m) 

h=l 

x P(t > nj, Z n j = h\ Zq = m) 



= X! P ( To > -i\ z o = h ) p ( T o > n :h Znj = h\Z = m) 

h=l 

< (l - cxp ( - mV{p* , 0) - AmeU P(r > nj \ Z Q = to) 
Iterating this inequality, we obtain the following result. 
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Lemma 7.7 For any e > 0, there exists j > 1 such that 



Vn > P(t > nj \Z =m) < (l - exp ( - mV(p*,0) - 4me 




It follows that 

E(t I Z = m) = p { T o >k\Z = m) 

k>l 

(n+l)j 

= Y,J2 P ( T ° > k \ Z v= m ) 

n>0 k—nj-\-l 

< ^2jP(rQ >nj\Z = to) < j exp (mV(p* , 0) + 4toe) 

whence 

lim sup — In E(t \ Z = to) < "^(p* , 0) + 4e . 

Letting e go to yields the desired upper bound. We prove next a lower 
bound on the hitting time. If ae~ a < 1, then p* = 0, V(p*,0) = 0, and 
obviously 

liminf —\nE{T \Z = m) > V(p*,0) = 0. 

£,m— >oo, q— >0 TO 

Thus we need only to consider the case ae~ a > 1. We start by estimating 
from below the probability of going from 1 to a neighborhood of p* without 
visiting 0. Before proceeding with the mathematical details, let us explain 
the strategy to get this lower bound. When Zq = 1, the binomial law 
involved in the replication mechanism can be approximated by a Poisson 
law of parameter a, and the process (Z n ) n >o can jump to any fixed fteN 
with a probability larger than a positive quantity independent of m. Using 
a simple estimate on the central term of the binomial law, we have that 

P(Z n+1 = G m (h) | Z n = h)> ) 

(to + iy 

where G rn is a map from { 0, . . . , to } to { 0, . . . , to } such that 
— G m (h) > F (-)--. 

TO V TO / TO 

Therefore we study the iterates of the function F(x) — 1/m. This function, 
which is a small perturbation of F, has two fixed points, one unstable close 
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to 0, of order 1/m, and one stable close to p* . We take h large enough so 
that h/m is above the unstable fixed point. Then the repulsive dynamics 
of F(x) — 1/m will bring the point h/m close to a value r\ > (independent 
of to) in a number of iterates of order In to. Once the process (Z n ) n >o is at 
\r]m\ , a finite number of iterates leads into the neighborhood of p* . The 
lower bound is obtained by combining the three steps: 

(\ c In m+c 
7 To , 
(m + iy J 

where c is a constant independent of to. This is the idea of the proof of 
the next lemma. 



Lemma 7.8 For any 6 > 0, there exist too,c > 0, depending on S, such 
that, for to > toq, 



P{Z 1 > 0, . . . , Zl^j.! > 0, Z [clnmi > m(p* -6)\Z = 1) > 



c In m 



Proof. The binomial law B(n,p) of parameters n > and p < 1 is 
maximal at |_(^ + , therefore 

VL(n + l)pjJ P ^ p) * n +l- 

See for instance chapter VI in Feller's book [S]. We shall use this inequal- 
ity to bound from below the transition probabilities of the Markov chain 
(Z n ) n >Q. Let us define a map G m : { 0, . . . , to } —t { 0, . . . , to } by 



Vh€{0,...,m-1} G m {h) 



(L(m+l)/(A)j +1 )e- 



G m (m) - [(m + l)e- a \ . 

Applying the previous lower bound to the binomial laws involved in the 
transition step of (Z n ) n >o, we obtain 



Vn,h>0 P(Z n+1 > G m (h) | Z n = h) > 
(m + 1)/' 



p h, 



G m {h),G m {h)\ > 



1 



(m + If 
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It follows that for n, h > 0, 

P(Zi > (G m )\h), ...,Z n > (G m ) n (h) \Z = h) 

p ( z i > (G m )\h), ...,Z n _ x = I, Z n > (G m ) n (h) | Z = h) 

i>(G m )»-Hh) 

]T P(Z n >(G m ) n (h)\Z n ^=l) 

- (Gm) *" 1(,,) x P[Z X > (G m )\h), ...,Z n - 1 = l\Z = h) 

> P(Z n > (G m ) n (h) | Z„_ x = (G m ) n_1 (/i)) 

xP(Z x > (G TO ) 1 (/i), . . . , Z„_! > (G TO ) n_1 (/i) | Z = /i) 
1 



> 



r P(Zi > (G m ) 1 (fc),...,Vi > (G TO )"- 1 (/i)|Z =/i) 



(m + l) 2 ' 

Iterating this inequality, we obtain, for n,h > 0, 

P(Zx > (G m ) 1 (ft),...,Z„ > (G m ) n (JO|Z„ = Z0 > 



1 



(to + l) 2 " ' 

The map G m : { 0, . . . , to } —> { 0, . . . , m } is non-decreasing. Moreover 



V/ie{0,...,m} G m (/i) > 



(m + l)/(-)e-° 
Vto/ 



> to 



ft 

Vto 



a - 1 



Let us define a map P m : [0, 1] — > [0, 1] by 

Va-G [0,1] P m (x) = P(z)--. 

TO 

We can rewrite the previous inequality as 

Vfce {0,...,to} G m (h)>mH m (- 



Iterating this inequality, we get, thanks to the fact that both G m and H„ 
are non-decreasing, 



Vn > V/i£ 



{0,...,to} (G m ) n (h) > m(H m ) n (-) 

Vto/ 



The map H m , which is a small perturbation of the map P, has two fixed 
points p' m < p^, whose expansion as m goes to oo is given by 



1 



m(ae a — 1) 



ae~ a - 1 ae~ a 
a — 1 m(ae~ a — 1) 
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Let 77 > 0. If x < 77, we have F(x) > ax, where 



a 



(a - 1)?7+1 



For ?y sufficiently small, we have a > 1 and the map F restricted to [0,77] 
is expanding. Let us study the iterates of x through the map H m . We set 

N = inf {n>0:(H m ) n (x) > 77 } . 

For 1 < 77 < TV. we have 

(H m ) n (x) = HmdHmr-^x)) > a{H m ) n -\x)--, 

TCI 

which we rewrite as 

Summing from n = 1 to N — 1, we get 

(H m ) N -\x) > /-{.-l^) * ^('--T-Ti 
\ m /L -~' o or j V m(a — 1) 

Let h be an integer such that 



h > 2- 



q - 1 



Notice that this condition does not depend on m. We suppose that m > h. 
We take x = h/m, and we denote by N(h) the associated integer. We have 
then 



^N(h)-lf]]_\ ^ n N>h)-lh 



Vm/ Z777 



Thus iV(/i) satisfies 



JV(fc) < 1 + — In 2n 



In a h 
and we have 



P(Zx > 0, . . . , Z N {h)-i > 0, Z N(h) > mr]\Z Q = h) 



> P{Z 1 > m{H m f (-) , . . . , Z N(h) > m(H m ) N W (_) | z = fc) 
> P{Zi >(G m )\h),..., Z N(h) > (G m ) N ^(h) \Z = h) 



1 

> 



(777 + l) 2iV W 
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We control next the probability to go from 1 to h. We have 

p(z 1 >h\z = i) > M f (m ~ ir ~ h M g (o, oy 



h J (a - 1 + m y 

In this regime, where h is fixed and m is large, the Binomial law involved 
in the replication mechanism can be approximated by a Poisson law of 
parameter er, whence, for m large enough, 

P(Z 1 > h\Z = l) > i cxp(-a)^- exp(-a/i) . 

We control finally the probability to go from ijin to the neighborhood of p* . 
We do this by following the iterates of F starting from 77, and by controlling 
the error term with respect to the iterates of H m . 

Lemma 7.9 We suppose that ae~ a > 1. For any m > 1, n > 0, x G [0, 1], 
we have 

1 (aer a ) n+1 
m ae — 1 

Proof. We have 

Vx e [0,1] \f'{x)\ < <j e - a , 

and, for any n > 0, 

(i/ m )" +1 (x) = H m ((H m ) n (x)) = F((H m ) n (x))--. 

m 

We shall prove the following inequality by induction on n: 

A ™ 

(^n)"^) > F n (x) _^(ae" Q ) fc . 

fc=i 

The inequality is true for n = 0, 1. Suppose that the inequality holds for 
some n > 0. Since F is non-decreasing, we deduce from the inequality on 
F' and the mean value theorem that 



1 1 

fc=l 

1 " 1 

> F" +1 (a-) - ae""- V (ae- )* 

m — ' ra 

1 

, n+l 

^ F " +1 (-)--E(- e_a ) A 



m 
fc=i 



fc=i 
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and the inequality is proved at rank n + 1. Summing the geometric series, 
we obtain the inequality stated in the lemma. □ 

Let 6 > 0. Now we have 

lim F n ( v ) = p* , 

n— >oo 

therefore 

3t > 1 F*(t?) > p* -S. 
For m large enough, we have 

m ere — 1 

so that, by lemma [731 

(fl^fa) > p*-S. 

Let i be an integer strictly larger than r\m. We have 

P(Zi > 0, . . . , Z t _i > 0, Z t > m(p* -S)\Z =i) 

> P{Zi > m{H m f{ri), ...,Z t > m(H m )\ri) \Z = i) 

> P{Z 1 > m(H m f (-),..., Z t > m(H m y (-) \Z = i) 

> P{Z l >{G m ) 1 (i) 1 ... 1 Z t >{G m )\i)\Z = i) > 1 

To conclude, we use the monotonicity of (Z n ) n >o and we combine the three 
previous estimates. The values h, t do not depend on m, and there exists 
a positive constant c depending on rj, h such that, for m large enough, 

N(h) +t + l < clnm, 



(^2 exp(-cr) — exp(-ah) 



clnm 

> 



(m + l)2A r ('i)+2t - m cln, 



Let us set 

s = |_clnraj - (N(h) +t) 
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We have 

P{Zi > 0, . . . ,Z [cAnmi _ 1 > Q,Z [clnmi > m(p* -8)\Z = 1) > 
^2 X] P ( Zl - h > ■ ■ ■ ' Zs ~ 1 ^ h > z s = i> z s+i > 0, . . . , 2 , s +AT(h)-i > 0, 

j>fa i>mrj 

Z s+N(h) = h Z s+N(h)+1 > 0, . . . , Z s+iV (fc)+i-l > 0, 

^ s +JV(h)+« > m (p* -S)\Z Q = 1) 
>J2/Z P(Zi>h,...,Z a _ 1 >h,Z s =j\Z = l)x 

j>h i>mr) 

P{Z s+ i > 0, . . . , Z s+N ( h )_ 1 > 0, Z s+N ( h ) =i\Z s = j)P{Z s+N ( h ) + i > 0, 
. . . , Z s+N (h)+t-i > 0, Z s+N (h)+t > m(p* — S) | Z a+ jv(fc) = «) 

> > fc, . . . , Z s > h I Z = 1) P ( Zl > °> ' • • ' z *(/0-i > °. 



J N(h) 



Z = h)P{Z x > 0, . . . , Z t -x > 0, Z t > m(p* - S) \Z =i) 



> 



(p{Zx >/i|Z = l)) S x 



P(Zi > 0, . . . , Zjv(fc)_i > 0, Zjv(/i) > mr)\Z = h) 



exp ( _cr )^T ex P(~ a/l ) 



(to + l) 2t 

1 



This is the required lower bound. □ 

Whenever the starting point is away from 0, the estimate of lemma [7791 can 
be considerably enhanced, as shown in the next lemma. 

Lemma 7.10 We suppose that cre~ a > 1. For any 8 > 0, there exist h > 1 
and c > 0, depending on 6, such that 

P(Zi > 0, . . . , Z h _! > 0, Z ft > m(p* -8)\Z = [mS\) > 1 - exp(-cTO) . 
Proof. Let S > 0. Since 

lim F n (S) = p* , 

n— >oo 

then there exists h > 1 such that F h (8) > p* — 8. By continuity of the map 
P, there exist poi Pii ■ ■ • , Ph > such that po — 6 , ph > p* — 5 and 

Vfte {!,...,&} F(p fc _i) > p fc . 
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Now, 

P(Z 1 > 0, . . . , Z h _ x > 0, Z h > m(p* -S)\Z Q = [m5]) > 

p(Vk G { 1, . . . , h } Z k > mpk | Z = [mS\ ) 
Passing to the complementary event, we have 

P(3fce{l,...,ft-1} Z k = or Z h <m(p* -S)\Z = [m6\) 
< P(3ke{l,...,h} Z k <mp k \Zo= [m6\^ 

- X] P { Zl - mf>1 ' • ■ • ' Zk ~ 1 - m Pk-l, z k < mp k | Z = |m£|) 



Kk<h 



< ^2 ^2 P(Zk-i =i, Z k < mpk | Zq = [m5\ 

l<k<h i>mph-i 



< /2 /Z P(Zk<mPk\Z k -i = i)p(z k - 1 =i\Z = Lm5j) 

< /2 p [ z i < m />* I = L m Pfc-iJ 



l<k<h 

The large deviation principle for the transition probabilities of the Markov 
chain (Z n ) n >o stated in proposition 17.31 implies that for k G { 1, . . . , h }, 

limsup — \nP(z 1 < mp k | Z = [mpk-i] 

£,m—>oo,q—tQ m ^ 
Iq— >a 

< -inf {l(p fc _ llS ,t) :s£ [0,l],t<p fc } < 0. 
Since h is fixed, we conclude that 

limsup I lnP (3fcG{l,...,ft-l} Z fe = 

^-►oo.g-M) m V or z h<m{p*-5) 

lq-^ta 

and this yields the desired estimate. □ 

With the estimate of lemma IT. 1Q[ we show that the process is very unlikely 
to stay a long time in [md, m(p* — 5)]. 

Corollary 7.11 We suppose that oe~ a > 1. Let 5 > 0. There exist h > 1 
and c > such that 



Z Q = [m,S] < 



Vfc G [m8, m(p* - 5)] Vn > 

n 
-h 



P[m5 < Z t < m(p* - S) for < t < n\ Z = k) < cxp ( - 



cm 
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Proof. Let k £ [mS, m(p* — 6)]. Let 5 > and let h > 1 and c > be 
associated to S as in lemma 17.101 We divide the interval { 0, . . . , n } into 
sub-intervals of length h and we use repeatedly the estimate of lemma [7~TU1 
Let i > 0. We write 

P(mS <Z t < m(p* - S) for < t < (i + l)h \Z = k) = 

P(mS < Z t < m(p*-S) for < t < (i + l)h, Z ih =j\Z = k) 

Sm<.j<.(p*—5)rn 

= Y p { mS < z t< m(p* - 5) for < t < ih, Z ih =j\Z = k) 

Sm<j<(p*—5)m 

x P(mS < Z t < m{p* - 5) for ih < t < (i + l)h \ Z ih = j) 
< ^2 P(mS < Z t < m(p* - 5) for < t < ih, Z lh = j \ Z Q = k) 

5m<j<.(p*—5)m 

x P(Z h <m{p* -S)\Z = [mS]) 

< P(mS < Z t < m(p* -5)ioi0<t<ih\Z Q = k) exp(-cm) . 
Iterating this inequality, we obtain 

Vi > P(m8 < Z t < m(p* - 5) for < t < ih \Z = k) < cxp(-cmi) . 

The claim of the corollary follows by applying this inequality with i equal 
to the integer part of n/h. □ 

We have computed the relevant estimates to reach the neighborhood of p* . 
Our next goal is to study the hitting time To starting from a neighborhood 
of p* . Since we need only a lower bound, we shall study the hitting time 
of a neighborhood of 0. For 6 > 0, we define 

Ts = inf { n > : Z n < mS } . 

Let i > (p* — S)m. We shall estimate the expectation of Tg starting from i. 
The strategy consists in looking at the portion of the trajectory starting at 
the last visit to the neighborhood of p* before reaching the neighborhood 
of 0. Accordingly, we define 

S = max { n < ts : Z n > (p* — 8)m } . 

We write, for n, k > 1, 

P(t S <n\Z Q = i) = P(rs=t,S = s\Z = i) 

l<s<t<n 

= }Z P(rs=t,S = s\Z =i)+ P(r s =t,S = s\Z = i). 

l<s<t<n l<s<n 
s<t<s+fe s-\-k<t<n 
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Let h > 1 and c > be associated to 8 as in corollary 17. Ill For 1 < s < n 
and i > s + fc, 



P(r« = t,5 = «|2b=i) 

= ^2 P(t s =t, S = s, Z s+ i =j\Z = i 

mS<j<~(p^—S)m 

f5m < Z r < (p* ~ S)m 
iors + l<r<t-l 



m5<j<(p* — <5)m 



< to cxp I — cm 



Zs+l = j 

t-s-2 



whence 



t- 1 



P(t^ = t, 5 = s I Zq = i ) < n to exp | 

l<s<n t>k 
s-\-k<t<n 

For 1 < s < t < n and t < s + k. 

P{r s = t, S = s\Z = i) 

< J2 P{rs=t,S = s,Z a =j\Z = i) 
j>(p*-S)m 

< p ( z t < Sm \ z s = j) 

j>(p*—5)m 

< mP{Z t _ s < Sm | Z = l(p* - S)m\ ) 

whence 

P(r s = t, S = s\Z =i) < 

l<s<n 
s<t<s+k 



Y mP(Z t <8m\Z = [(p* - 5)m\ ) 



i<t<fc 

Putting together the previous inequalities, we obtain 

*- 1 

<n exp I — cm 

t>k 



P( T 5 < n ■ | -^0 — £) < n ^2 111 CX P ( 



'-I 



Y mP(Z t <5m\Z = [(p* - 5)m\ ) 



Kt<k 
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We choose k large enough so that 

limsup — In ( ~S~^ m exp ( — cm — \ ) <—V(p*—5,5). 



',m— foo, q—yO 
lq-^ta 



and we use the large deviation principle stated in corollary [73] to estimate 
the second sum: 

limsup — In | y^ mP(Z t < 5m \ Zq = [(p* — 8)m\ 



iq—ya 



Kt<k 



< - min V t (p*-5,5) < -V(p*-S,S). 

l<t<k 

Applying the previous inequalities with n = exp(mV^(p* — 5,5) — n%5), we 
conclude that 

lim P(t s < exp(mV{p* -5,5)- mS) \Z = i) = 

£,m— >-oo, q— ^0 
Iq— >a 

and therefore 

lim inf — In E(t s I Z = i) > V(p* -5,5)- 5. 

I ,m— >ao, q— >0 m 
lq-^a 

To derive a lower bound on the expectation of tq starting from 1, we com- 
bine the previous estimates as follows. By lemma VTE[ asymptotically, 

P(Z 1 > 0, . . . , Z|_clnmJ-l > 0, ^[clrimj > ~ 5) \ Z = l) > clpTO ■ 

Thus, letting i = \_{p* — 5)m\ + 1, for any n > [clnmj, 
P(r > n | Z = 1) > 

^ f(Zi > 0, . . . , ^[clnmj-l > 0, ZjclnrnJ = j, Tq > 71 \ Zq = l) 

j>i 

> y^-P C^l > 0, . . . , Z|_ clnm j_ a > 0, Z|_ clnm j = j I Z Q = l) 

x P (-^LclnrnJ+l > 0, . . . , Z n > | Z\_ c \ nm j = j) 

> P(Z 1 > 0, . . . , Z|_ clnTO j_! > 0, Z|ci nm j > m(/9* - <5) | Z = 1) 

x P(ZL c i nm j + i > 0, . . . ,Z n > 1 Z|_ clnm j = i) 

^ r In r» P ( T > U ~ L clllm J I ^0 = ») • 
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Summing from n 



[clnmj to +00, we get 



E(t \Z = 1) > 



1 



E{t \Z = i). 



m 



c In m 



The very definition of t$ implies that tq > t$, whence 



E(t I Z = i) > E{t s \Z = i). 



From the lower bound on rg and the previous inequalities, we deduce that 



7.3 Concentration near p* 

In this section, we estimate the numerator of the last formula of section [rI751 
As usual, we drop the superscript 9 from the notation when it is not neces- 
sary, and we put it back when we need to emphasize the differences between 
the cases 9 = 1 and 9=1. Wc define, as before proposition 17.61 



Let / : [0, 1] — > M be a non-decreasing continuous function such that 
/(0) = 0. Our goal here is to estimate the expected value of the sum 



The Markov chain (Z„)„>o is a perturbation of the dynamical system as- 
sociated to the map F, therefore it spends most of its time in the neighbor- 
hood of the stable fixed point p* . On very large time intervals, the process 
visits points far away from p* and then it returns quickly to p* . Thus the 
fraction of the time spent away from p* is negligible. We will show that 
the above sum is, on average, comparable to f{p*)TQ. 

Proposition 7.12 Uniformly over i S { 1, . . . , m }, we have 



£,m— >oo, q— s-0 m 



liminf — 1uE(t \ Z = 1) > V{p* -5,6) -5. 



The conclusion follows by letting 6 go to 0. 



□ 





lim 

t,m— ¥00 
q— >0, Iq— >a 
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Proof. Before proceeding to the proof, let us explain the general strategy, 
which comes directly from the theory of Freidlin and Wentzell. Let us 
denote by U(S) the ^-neighborhood of p* . We choose 5 small enough, so 
that when the process is in the neighborhood U(25), the value f(Z n /m) 
is approximated by f(p*). When the process is outside of { 0} U U(2S), it 
reenters U(6) in [clnmj steps with probability at least m _clnm , for some 
c > (see lemma 17. 15ft . Therefore the average length of an excursion is 
bounded by m clnm . At a given time, the probability to start an excursion 
from U(5) reaching the outside of U(2S) is less than exp(— cm). With this 
estimate we can control the number of excursions (see lemma 17. 14ft and 
we show that, typically, their total length until the time tq is negligible 
compared to tq. 

Let e > 0. Since / is continuous, there exists 6 > such that 

W pe }p*-2S,p* + 2S[ \.f(p)-f(p*)\ < £■ 

We define then a sequence of stopping times to follow the excursions of 
(Z n ) n >o outside ]p* — S,p* + S[. More precisely, we set T = and 

T* = inf { n > : — e }p* - 6, p* + S[ } , 
m 

T x = inf { n > T* : — <£}p* - 25, p* + 2S[ } , 



T* = inf { n > T fc _! : ^ e }p* - 6, p* + S[ } 
m 



inf { n > Tt : — ^}p* - 25, p* + 2S[ } 
1 m 



We decompose the sum over the intervals [Tk-i,T%[, [T£,T k [, k > 1. De- 
noting by s A t the minimum min(s, t), we have 

TfAT-o-1 / \ T k Aro-l / \ 

E E /(^ -«"") +E E / § -/(/) ■ 



V m 1 ) r— ' -4^ \ V m 

fe>ln=T fc _ 1 AT \ / fc>ln=T*AT \ 

We bound next the absolute value as follows: 



E/(=)-w 



n=0 



< 2/(1) ^ (t,! A r - T fc _x A T ) + £ r . 



fc>i 
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It remains to deal with the sum. We define, for n > 0, 
K(n) = max { k > 1 : Tk-i < n } , 
and the sum becomes 

K(t ) 

J2 (Tk A ro - T fe _! A to) = 2 (T fe * A ro - T fe _! 
fc>i fc=i 

Let r) > 0. We set 

C = fflcp(m(y(p*,0) + 77)). 
We decompose the sum as follows: 

K(t ) K(t ) 

{rt A r - Tfe.iJ < r l ro>t ; :i + l T0 <e 51 ( T fc A r ° ~ Tfc -! 
fc=i fc=i 

We suppose that the process starts from i E { 1, . . .,m}. The estimates 
are carried out exactly in the same way for any value of i, therefore, to 
alleviate the notation, we remove the starting point from the notation. 
Throughout the proof the expectation E and the probability P are meant 
with respect to the initial condition Zq = i. Taking expectation in the 
previous inequalities, we get 



> n=0 



/ K(t ) 

< eE(T )+2f(l)E(r l To>t .J+2f(l)E l To < c £ (t* A r - T fc _i) 

\ fc=i 

Next, we take care of the second term. 
Lemma 7.13 For any N, j > 1, 

-^(^0-1 ro>A r j 

n>N 

Proof. We compute 

E(r l To>Nj ) = kP(ro = k) = J2 J2 W-P(r = fc) 

k>Nj k>Nj n>0 

= £ E P ( To = k ) = E P ( T " > max(iVi,n)) 

n>0 Jc>iVj n>0 

< NjP(r > Nj) + J2 P{ro>n). 

n>Nj 



r/2 



Next, 

j-i 

P{ro > n) = P ( T ° > n J + k ) < E -? P ( T ° > n J') 

n>JVj n>JV fc=0 n>JV 

and we have the desired inequality. □ 
We apply lemma [7771 with e = 77/8: there exists j > 1 such that 

Vn > P(r > nj | Z Q = m) < (l - cxp ( - mV(p* , 0) - mr)/2)j . 
Wc apply lemma 17.131 with this j and 

N = LC/J'J = [iexp(mV{p*,0) + mri)\ 
and we use the previous inequality: 

E(r l To>t n m ) < E(T Q l T0>Nj ) < NjP( TQ > Nj) + J2 j p ( T o > nj) 

n>N 

fotj + j exp (mV(p*,0) + mr)/2)) (l - cxp ( - mV(p*, 0) - m-q/2)^ 



< 



< 



(1 + j) exp [mV(p* , 0) +mrj) exp — iVexp ( — mV(p* , 0) — mq/2) 



Thanks to the choice of N, this last quantity goes to as m goes to oo. 
We deal now with the last sum in the inequality before lemma 17.131 Wc 
give first an upper bound on K. 

Lemma 7.14 There exists c > 0, depending on S, such that, asymptoti- 
cally, 

Vfc, n > P(K(n) > k) < — cxp(-cmfc) . 

Proof. We denote by U(6) the ^-neighborhood of p*\ 

U(S) =}p*-6,p* + 6[. 

For k > 0, we define 

St = sup {T k < n < T k : — e U(S)} . 

m 

For k, n > 0, we have 

P(K(n) >k) = P{T k < n) = ]T P(T* =t*,S k = s, T k = t) . 

t*<s<t<n 
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Let h > 1 and c > be associated to 6 as in corollary |7.11l We can suppose 
that h > 2. For given values of t* and s, we split the sum over t in two 
parts: 



p{n = t*,s k = s ,T k = t) = 



E 



t:s<t<n 



t:t>s+h 



t:s<t<s+h 



We study next the first sum, when t > s + h. We condition on the state at 
time s + 1: 



E • 

t:t>s+h 



p{n = t* , s k = s , z s+1 = j, r k = t) 



t:t>s+h 
j£mU(25)\mU(8) 



TZ=t*,Z s+1 =j,Z t £mU(26) 



t:t>s+h 
jemU(2S)\mU(S) 



Epl k 
[Z s+1 ,...,Z t -i€mU(28)\mU{6)J 

P(Zs+i, • • • , Z t -i € mU(25) \ mU(S), Z t # mU(25) \ Z s+1 = j) 

xP(T k * = t*,S k = s,Z s+1 =j). 



t:t>s+h 
j£mU(2S)\mU(S) 



For < s < n and t > s + h, 



P(Z S+1 , Z t _! e mU(25) \ mU{5), Z t mU(26) \ Z s+1 = j) 

f5m < Z r < (p* — 8)m 
for s + 1 < r < t - 1 



< P 



Zs+1 = j 

< exp I — cm 



t-s-2 



\) 



Thus 



E ■ 

t:t>s+h 



< 



E cx p 

\ t>h 



t- 1 



\P(T!=t*S k = s) 



Let us focus on the second sum. We condition on the state at time s: 
E •••= E P(T k * = t*,S k = s,Z s =j,T k =t) 



t:s<t<s+h 



t:s<t<s+h 
jemU(S) 



< P{Zt£mU(26)\Z s =j)P(T k * = t*,S k = s,Z s =j) 



t:s<t<s+h 



< ]T P(Zt£mU(26)\Z =j)P(T k * = t*,S k = s,Z s 



t:l<t<h 
j€mU(S) 
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For any j € mU(5), using the monotonicity of (Z n ) n >o, 

P(Z t £ mU(25) | Z = 3) < P{Z t < m{p* - 25) \ Z Q = j) 

+ P(Z t > m(p* + 25) I Z Q = 3) 
< P(Z t < m{p* - 25) I Z = l(p* - 5)m\) 

+ P(Z t > m(p* + 25) j Z = [(p* + 5)m\) . 



We use the large deviation principle stated in corollary 17.41 to estimate the 
last two terms. For any t £ { 1, . . . , h }, 

limsup —\nP(Z t < m(p* - 25) \ Z = [{p* - 5)m\) 

t,m— >oo. q— >0 

lq ^ a < - inf { V t {p* - 5, p) : p < p* - 25) } , 

limsup — lnP(Z t > m(p* + 25) \ Z = [(p* + 5)m\) 

l,m— >oo. q— >0 m 

iq ^ a < -inf {V t (p* + 6,p) :p > p* +25)}. 

By compactness, the infima are realized. Because of the constraints on p, 
the point p realizing the infimum 

inf {V t (p* -5 lP ):p<p* -25)} 

is not an iterate of p* — 5 through F, hence by lemma [775l the above infimum 
is positive. We argue in the same way for the second infimum and we 
conclude that there exists c' > 0, depending on 5, such that, asymptotically, 

Vj € mU{5) P ( Z t & mU(25) \ Z = j) < exp(-c'm) , 

t:l<t<h 

whence 

53 ■ • • < exp(-c'm) P(T* k =t*,S k = s) . 

t:s<t<s+h 

Let c" > be such that, asymptotically, 

53 ex P f ~ cm — t — + exp(— c'm) < cxp(— c"m) . 
t>h 

Reporting in the initial equality, we obtain that, asymptotically, for any 
n, k > 0, 

P(T k < n) < ^ exp(-c"m)P(r fe * = t* , S k = s) 

t*<s<n 

< 53 P ( T k = **) exp(-c"m) < 53 P(Tk-i < t*) exp(-c"m) . 

t * < n t * < n 
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Iterating this inequality, we obtain 

P(T k < n) < exp(-c"mfc) , 

0<no<"-<nfe-i<n 

whence 

P(T k <n) < — exp(-c"mfc) 
k\ 

as required. □ 
We estimate now the last sum. By the Cauchy-Schwarz inequality, we have 

/ K(t ) \ 

fe>i v / 

< ^P(r <C, K(r ) > k) 1/2 ( s( W (ro) (T* A r - T k ^f) J 

fc>i V / 



1/2 



If 1 < fc < -RT(to), then Tfc„i < To and Zx k _ 1 > 0, so that, using the Markov 
property, 



E E(l k < KiTo) (T* k Ar Q -T k ^] 



Z Tk _ t =j)P(Z Tk _ 1 =j) 

l<j<m 

< ]T S((T*Ar ) 2 |Z =j)P(^T fc _ 1 =i). 

l<j<m 

We will next bound the time T-j* A To, starting from j 6 {l,...,m}. 

Lemma 7.15 We suppose that <re~ a > 1. For any 5 > 0, there exist 
mo, c > 0, depending on 5, such that, for m > mo, for j G { 1, . . . , m }, 

P(m(p* - S) < Z [clnmi < m(p* +S)\Z = j) > c ^ nw . 

Proof. Using lemma [7781 there exists c > such that, for m large enough, 

P(Z lclnml < m(p* - 6) | Z = 1) < 1 - — ^ . 



• r j(i 



Proceeding as in lemma [7. 101 we obtain that there exist h, d > such that 
P(Zh > m(p* + 5) | Zq = m) < exp(— dm) . 

We have then 

P{Zy c \nm\ > m (p* + 6) | Z Q = m) = 

^ f(-^LcinmJ > m iP* + s ), Zy c i nm \- h = j\Z = m) 

je{ l,...,m} 

= /\2 P(Zh>m(p*+5)\Z =j)P(Zi clnmi _ h =j\Z = m) 
je{ i,...,m} 

< exp(— dm) . 

Using the monotonicity of (Z n ) n >0i we have then 

P(Zlclnrn\ &]™,(p* - S),m(p* + S)[\ Z = j) 

< P{Zy c \ nm \ < m (p* -6)\Z = j) + P{Zy c y nm \ > m(p* +S)\Z = j) 

< P{Zici nm \ < m(p* -5)\Z = l) + P(^Lcinmj > m(p* + S)\Z Q = m) 



^ 1 - ^hT^T + exp(-c'm) . 
This estimate is uniform with respect to j G { 1, . . . , m }. 



□ 



Corollary 7.16 Wc suppose that ae a > 1. For any S > 0, there exist 
mo, c > 0, depending on S, such that, for m > mo, for j G { 1, . . . , m }, 



Vn > PiTl A t > n[clnm\ \Z =j) < ( 1 



1 \» 



Proof. We proceed as in corollary 17.111 to obtain this inequality. We 
divide the interval {0, . . . ,n[c\nm\ } into subintervals of length [clnroj 
and we use repeatedly the estimate of lemma 17.151 □ 



By corollary 17. 161 we have, for any j G { 1, . . . , m }, 

E((Tl A r ) 2 I Zo = j) = F ( T i A T > \fk | Zq = j) 

VI 



k>l 



< 



k>\ 



Let us set 



a = 1 



m' 



: In J 



[c In mj 



£ = [c In mj 
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We have 



t 2 f°° 

dx = —. nj / e^dy, 

a{hxay J 



k>l k>l 

therefore, asymptotically, for any j, 

E((T* Ar ) 2 \Z =j) <m 3clnm . 
Reporting in the previous inequalities, we have 

Vk > 1 E(l k < K{To) (T* k A to - T k ^f) < m 3cWm 
and, using the estimate of lemma 17.141 

/ K(t ) \ 

V fc=l / fc>0 



< m 



3 c 1 



nm (C exp(-c'm/3) + £ (&- exp(-c'mfc) 

fc>im exp( — c'm/3) 

— — c m— 
2 3 



1/2 



< m 3clnm (tl exp(-c'm/3) + ^ cxp f- 



fe>i 



To get the last inequality, we have used that fc! > (k/e) k , whence, for 
k > t!/ n exp(— c'm/3), 

(fV ) k /P fV \k 

< \^y) < exp(/fc + c'mfc/3). 
We choose rj such that r\ < c'/3. The above inequality implies that 



1 / K(to) 

limsup -ln£ l T0 < t , £ 

re— s.oo,<j— s-0 '"- \ fc— i 



fcAro-Tfe-xj < V(p*,0)+r,--. 



All these estimates, together with proposition 17.61 imply that, asymptoti- 
cally, uniformly with respect tois{l,...,m}, 



E\ 



* n=0 



2 /(^) Z =i\- /(/>*) £(r I Zo = i) < 3 eJ B(r | Z = i) • 



This achieves the proof of proposition 17.121 



□ 
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8 The neutral phase 



We denote by AT the set of the populations which do not contain the master 
sequence w* , i.e., 

A £ \{w*}) . 

Since we deal with the sharp peak landscape, the transition mechanism of 
the process restricted to the set N is neutral. We consider a Wright-Fisher 
process (X n ) n >o starting from a population of M . We wish to evaluate the 
first time when a master sequence appears in the population: 

r» = inf { n > : X n <£ N } . 

We call the time the discovery time. Until the time r», the process 
evolves in N and the dynamics of the Wright-Fisher model in N does not 
depend on a. In particular, the law of the discovery time r* is the same for 
the Wright -Fisher model with a > 1 and the neutral Wright-Fisher model 
with cr = l. Therefore, we compute the estimates for the latter model. 
Neutral hypothesis. Throughout this section, we suppose that er = 1. 

8.1 Ancestral lines 

It is a classical fact that neutral evolutionary processes are much easier to 
analyze than evolutionary processes with selection. The main reason is that 
the mutation mechanism and the sampling mechanism can be decoupled. 
For instance, it is possible to compute explicitly the law of a chromosome 
in the population at time n. Let /iq be an exchangeable probability distri- 
bution on LA) . Let (X„)„>o be the neutral Wright-Fisher process with 
mutation matrix M and initial law fiQ. Let vq be the component marginal 
of /i : 

Vu G A 1 vq{u) = fJ, {{x G (A e ) m : x(l) =u}). 

Let (W n ) n >o be a Markov chain with state space A e , having for transition 
matrix the mutation matrix M and with initial law Uq. 

Proposition 8.1 Let i G { 1, . . . , m }. For any n > 0, the law of the i-th. 
chromosome of X n is equal to the law of W n . 

Proof. We do the proof by induction over n. The result holds for n = 0. 
Suppose that it has been proved until time n. Let i G { 1, . . .,m}. We 
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have, for any u G A , 
P(X n+1 (i) = u) = E P(Xn+i(i) = u,X n = x) 

= J2 P{X n+ i(i)=u\X n = x)P(X n = x). 

Yet we have 

P(X n+1 (i) =u\X n = x) = — Y M(x(j), u) . 

l<j<m 

Thus 

P(X n+1 (i) = u) = E 1 E M{x(j),u)P{X n =x) 

x£(A e )™ l<j<m 

= E 1 E M(v,u)P(X n (j)=v). 
ug^ i<7<m 

By the induction hypothesis, 

V« e ^ Vj e { 1, . . . , m } P(X„(j) = w) = P(VF„ = v) , 
whence 

P(X n+ l(i) =«) = E P ( W n = V)M{V,U) = P{W n+1 = u) . 

The result still holds at time n + 1 . □ 

We perform next a similar computation to obtain the law of an ancestral 
line. Let us first define an ancestral line. For i G { 1, . . . ,m} and n > 1, 
we denote by n,n — 1) the index of the ancestor at time n — 1 of the 
i-th chromosome at time n. More precisely, if the z— th chromosome of the 
population at time n has been obtained by replicating the j-th chromosome 
of the population at time n — 1, then X(i,n,n — 1) = j. For s < n, the 
index n, s) of the ancestor at time s of the i-th chromosome at time n 
is then defined recursively with the help of the following formula: 

T(i, n, s) = T(T{i, n,n — l),n — 1, s) . 

The ancestor at time s of the «-th chromosome at time n is the chromosome 

ancestor(i, n, s) = X s (I(i, n, s)) . 



GO 



The ancestral line of the i-th chromosome at time n is the sequence of its 
ancestors until time 0, 

(ancestor(i, n, s), < s < n) = (X s (I(i, n, s)), < s < n) . 



Proposition 8.2 Let i € {l,...,m}. For any n > 0, the law of the 
ancestral line (ancestor(i, n, s), < s < n) of the i-th chromosome of X n 
is equal to the law of (Wo, . . . , W n ). 

Proof. We do the proof by induction over n. The result is true at rank 
n = 0. Suppose it has been proved until time n. Let i € { 1, . . . , m } and 
let Uq, . . . , u n+ i £ A 1 . We compute 

P(anccstor(i, n + 1, s) = u s , < s < n + l) 



= E E p 

xe(A e ) m l<j<m 

= E E p 



X n +x{i) = u n+1 , l(i,n+ l,n) = j 
X n = x, ancestor(j, n, s) — u s , < s < n 

X n +\(i) = u n+ i ancestor(j, n, s) = u. 
n + 1, n) = j < s < n, X n = x 



x P 

Since we deal with the neutral process, we have 
P 



/ancestor(j, n, s) = u. 
V < s < n, X n = x 



l(i,n + l,n) =j 
= P 



X n +i(i) = u n+ i ancestor(j, n, s) = u 
< s < n, X n = x 

X n+ i(i) = u n+ i 
n + 1, n) = j 

Reporting in the previous inequality, we get 



X n = x ) = —M(x(j), u n+1 ) . 
m 



P(anccstor(i, n + 1, s) = u s , < s < n + l) 



E E ^M(x(j),u n+1 ) P ( 



m 

ce(A e ) m l<j<m 



/ancestor(j, n, s) = u, 
< s < n, X n = x 



u n+ i) P(ancestor(i, n, s) = u s , < s < n) 



m 

l<j<m 



By the induction hypothesis, we have 

P(ancestor(i, n, s) = u s , < s < nj = P(Wq = ito, • • • , W n = u n ) 



Gl 



Therefore 



P(ancestor(i, n + 1, s) = u s , < s < n + l) = 

P(W n+ i = u n+ i | W n = u„)P(W = ito> • • • , W n = u n ) 

= P(Wo =UQ,..., W n +1 = U n +1 



) 



and the induction step is completed. 



□ 



8.2 Mutation dynamics 

We consider a Markov chain (F ra )n>o with state space { 0, . . . , £ } and having 
for transition matrix the lumped mutation matrix My . In this section, we 
recall some properties and estimates concerning the Markov chain (Y n ) n >Q. 
We refer to the corresponding section of [3] for the detailed proofs. The 
Markov chain (Y n ) n >o is monotone. We denote by B the binomial law 
B(£, 1 — 1/k) with parameters £ and 1 — i.e., 



The matrix Mh is reversible with respect to the binomial law B. This bino- 
mial law is the invariant probability measure of the Markov chain (Y n ) n >Q. 
When £ grows, the law B concentrates exponentially fast in a neighbor- 
hood of its mean £ K = i(l — 1/k). We restate next without proofs several 
inequalities and estimates proved in [3]. 

Lemma 8.3 For any b £ { 0, . . . , m } and n > 0, we have 



For b £ { 0, . . . , £ }, we define the hitting time rib) of { b, ...,£} by 
r(b) = inf { n > : Y n > b } . 






Lemma 8.4 For b < £/2, we have 
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Lemma 8.5 For n such that 

2?) 



n > 



P (4 - b) ' 
wc have 

4/ 

P(r(b)>n\Y o = 0) < . 

np (l K - b) 

Proposition 8.6 Wc suppose that i — > +oo, q — > 0, tq — > a <G]0,+oo[. 
Asymptotically, we have 

Proposition 8.7 We suppose that t — > +oo , q — > Q,lq — > a g]0, +oo[. 
Asymptotically, we have 

Vn > y/i P(Y n > \n£\Y = 0) > 1 - 



Proposition 8.8 Wc suppose that t — > +00,9 — > 0,£q — > a G]0,+oo[. 
Let e G]0, l/4[. Asymptotically, we have 

Vn>— P(F n >4(1-4 £ )|F = 0) > 

We define 

r = inf { n > : Y n = } . 



Proposition 8.9 For any a G]0,+oo[, 

lim sup i In E(t \Yq = i) < In k . 

I— >oo,q— >0 £ 
£q^a 

8.3 Discovery time 

The dynamics of the processes (Of l ) n >Q, (O^) n >o in M are the same as 
the original process (O n )n>0; therefore we can use the original process to 
compute their discovery times. Letting 

r*' £ = inf { n > : 0* e W* } , r*' 1 = inf { n > : G W* } , 
r* = inf { n > : O n G W* } , 
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we have 

E(r*> e | O e = oLit) - E{r* \ O = (0, 0, 0, . . . , to)) , 
E(r* A | Ol = ol^ = E(t* I O = (0, m, 0, . . . , 0)) . 

In addition, the law of the discovery time r* is the same for the distance 
process and the occupancy process. With a slight abuse of notation, we let 

t* = inf { n > : D n e W* } . 

Notation. For b E { 0, ...,£}, we denote by (6) m the vector column whose 
components are all equal to b: 



(b) m = 



We have 

E(t* I O = (0, 0, . . . , 0, to)) = E(t* I Do - W m ) , 
E(t* I Oo - (0, to, 0, ... , 0)) = £(r* I D = (l) m ) . 

We will carry out the estimates of r* for the distance process (D n ) n >Q. 
Notice that the case a = +oo is not covered by the result of next proposi- 
tion. This case will be handled separately, with the help of the intermediate 
inequality of corollary 18. Ill 

Proposition 8.10 Let a e]0, +oo[ and a e [0, +oo[. For any d e Af, 
lim \ ]hE(t* \D = d) = In k . 



Proof. Since we are in the neutral case a = 1, then, by corollary |5.4[ the 
distance process {D n ) n >o is monotone. Therefore, for any d € Af, we have 

E(t* I Do = (l) m ) < E(t* \D = d) < E(t* I Do = (£) m ) . 

As in the section 18.21 we consider a Markov chain (F„)„>o with state 
space {0,...,^} and having for transition matrix the lumped mutation 
matrix Mr- Let us look at the distance process at time n starting from 
(t) m . From proposition [5TTJ we know that the law of the i-th chromosome 
in D n is the same as the law of Y n starting from £. The main difficulty 
is that, because of the replication events, the m chromosomes present at 
time n are not independent, nor are their genealogical lines. However, 
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this dependence does not improve significantly the efficiency of the search 
mechanism, as long as the population is in the neutral space Af. To bound 
the discovery time r* from above, we consider the time needed for a single 
chromosome to discover the master sequence if* , that is 

T = inf { n > : Y n = } 

and we remark that, if the master sequence has not been discovered until 
time n in the distance process, that is, 

Vt<n Vi G {l,...,77i} D t (i)>l, 

then certainly the ancestral line of any chromosome present at time n does 
not contain the master sequence. By proposition 18. 2[ the ancestral line of 
any chromosome present at time n has the same law as Yq , . . . , Y n . From 
the previous observations, we conclude that 

Vn > P(t* >n\D = (£) m ) < m P(r > n \ Y = I) . 

Summing this inequality over n > 0, we obtain the following upper bound 
on the discovery time. 

Corollary 8.11 Let To be the hitting time of for the process (Y n ) n >o. 
For any d £ Af, any m > 1, we have 

E(t*\D = cL) < mE( To \Y = e). 

With the help of proposition ^. 9[ we conclude that 

lim sup — In E(t* \ Dq = ct) < ln«. 

£.m— >oo, q— fO *~ 

The harder part is to bound the discovery time t* from below. The main 
difficulty to obtain the adequate lower bound on t* is that the process starts 
very close to the master sequence, hence the probability of creating quickly 
a master sequence is not very small. Our strategy consists in exhibiting a 
scenario in which the whole population is driven into a neighborhood of the 
equilibrium i K . Once the whole population is close to £ K , the probability to 
create a master sequence in a short time is of order thus it requires 

a time of order n l . The key point is to design a scenario whose probability 
is much larger than Indeed, the mean discovery time is bounded 

from below by the probability of the scenario multiplied by n l . We rely on 
the following scenario. First we ensure that until time £ 3 ^ 4 , no mutation 
can recreate the master sequence. This implies that t* > £ 3 ^ 4 . Let us 
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then look at the population at time £ 3 ^ A . Each chromosome present at this 
time has undergone an evolution whose law is the same as the mutation 
dynamics studied in section 15721 The initial drift of the mutation dynamics 
is quite violent, therefore at time f 3 / 4 , it is very unlikely that a chromosome 
evolving with the mutation dynamics is still in { 0, ■ • • , \n£ }. The problem 
is that the chromosomes are not independent. We take care of this problem 
with the help of correlation inequalities. Thus, at time £ 3//4 , in this scenario, 
all the chromosomes of the population are at distance larger than In I from 
the master sequence. We wait next until time £ 2 . Because of the mutation 
drift, a chromosome starting at ln^ has a very low probability of hitting 
before time £ 2 . Thus the process is very unlikely to discover the master 
sequence before time £ 2 . Arguing again as above, we obtain that, for any 
e > 0, at time £ 2 , it is very unlikely that a chromosome evolving with the 
mutation dynamics is still in { 0, ■ • • ,£ K (1 — s) }. Thus, according to this 
scenario, we have r* > £ 2 and 

Vie {l,...,m} D P {i) > 4(1 -e). 

Let us precise next the scenario and the corresponding estimates. We 
suppose that the distance process starts from (l) m and we will estimate 
the probability of a specific scenario leading to a discovery time close to n e . 
Let £ be the event 

£ = {Vn<£ 3/i Vie{l,...,m} U^ 1 > p/n } . 

If the event £ occurs, then, until time ^ 3 / 4 , none of the mutation events in 
the process (-D n )n>o can create a master sequence. Indeed, on £ , 

Vb€{l,...,£} Vn<£ 3/4 ViG{l,...,m} 

M H (b, Uti\..., U?) > M H (1, l%\ . . . , U 1 /) 

e 

> 1 + lrr*.^i 1 1 i / i > 1- 

— / j Un >1— p(l— 1/k) — 

1=2 

Thus, on the event £ , we have r* > £ 3 / 4 . The probability of £ is 

P{£) = (1 



H / 

Let e > 0. We suppose that the process starts from (l) m and we estimate 
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the probability 

P(r* >^ 1 - £ )) > P(t* >K ti - 1 ~ e \£) 

> P Nn 6 { £ 3/ \ . . . , k^ 1 -^ } D n G TV, f ) 
= ^ P(Vn € { £ 3 / 4 , . . . , k^ 1 " 8 ) } D„ e AT, D P /4 = d, £} 

> p ( Vn e { ^ 3/4 ' ■ • ■ ' k£(1_£) > i? « e ^ i D ^ ii = d ' f ) 

d>(lnf) m 

x P(L» £3/ 4 =d,S). 

Using the Markov property, we have 

P (Wn e { £ 3/4 , . . . , ^ (1 " e) } £>„ e N | L>£3/4 = d, 

= p(Vn e { 0, . . . , ^ (1 " e) - £ 3/4 } D n eJV\D Q = d) 

= p(r* > w'C 1 -) - £ 3 / 4 I £) = d) > p(t* > k^ 1 "^ | D = d) . 

In the neutral case, by corollary 15 .41 the distance process is monotone. 
Therefore, for d> (lnfp, 

P(t* > K^ 1 '^ \D = d) > P(t* > K l{l ~ e) I D Q = (ln£) m ) . 

Reporting in the previous sum, we get 

P{t* > K l{1 ~ e) ) > P(t* > n t( - l -^\D Q = (\n£) m ) P(D e3/i > (ln£) m , £) . 

We first study the last term in the above inequality. The status of the 
process at time £ 3 ^ 4 is a function of the random matrices 

R n = (^,^ 1 ,...,^) 1 ^ m , l<n<£ 3 / 4 . 

We make an intermediate conditioning with respect to the variables S l n : 

P{D i3/ 4 > (\n£) m , £) 

= E^P(D e3/i > (ln£) m , £ I S l n , 1 < i < m, 1 < n < £ 3 / 4 )) . 

The variables S % n , 1 < i < to, 1 < n < £ 3 / 4 being fixed, all the indices of 
the chromosomes selected for replication are fixed, and since the mutation 
map Mn(-,Ui, . . . ,Ui) is non-decreasing with respect to m, ■ ■ ■ , ug, we see 



or 



that the state of the process at time 

£3/4 

is a non-decreasing function of 

the variables 

l<i<m, l<n<£ 3 /\ 

Thus the events £ and Dp/i > (ln£) m are both non-decreasing with respect 
to these variables. By the FKG inequality for a product measure (see 
section we have 



P(D g3/ 4 > (lne) m , £\S % n ,l<i<m,l<n< £ 3/4 ) > 

P(D p/ 4 > (\nl) m | S q n , l<i<m,l<n< £ 3/4 ) 

x P(£ | S l n , 1 < i < m, 1 < n < £ 3/4 ) . 

Yet £ does not depend on the variables S l n , therefore 

P(£\Sll<i<m,l<n< £ 3/i ) = P{£) . 

Reporting in the conditioning, we obtain 

P(D PI * > (ln£) m , £) > 

e(p(d p/ 4 > (iney n | si, 1 < i < m, 1 < n < £ 3/i ) P(£ )) 

= P(D e3/4 > {\n£) rn )P(£) . 
By proposition l5.8[ the distance process has positive correlations, therefore 



P{D e s/4 > (ln£) m ) = P(Vie {l,...,m} D p/i {i) > ln£) 

> Y[ P(D e3/i (i) > ln£) . 

l<2<m 

From propositions 18.11 and 18. 71 

Mi G {l,...,m} P(D £3/i (i) > ln£) = P(y £3/ 4 > ln£) > 1 - 

Putting the previous estimates together, we have 

6 \ m ( p\ m£3/1 



P(2W> M »,*)>(l-^) (l-£) 
We study next 

P(r* > K l{l ~ e) | U = (ln^) m ) . 

We give first an estimate showing that a visit to becomes very unlikely 
if the starting point is far from 0. 



G8 



Lemma 8.12 For b € {1, ...,£}, we have 

Vn > P(t* < n \ D = (b) m ) < nm^^ . 

Proof. Let n > and b G { 1, . . . ,£ }. We write 

P{t* <n\D = (b) m ) = 

P(3t<n 3i€{l,...,m} D t (i) = 1 D = (b) m ) 

^ E E P{D t (i)=0\D = (br). 

l<t<n l<i<m 

By proposition 18. 11 for any t > 0, any i G { 1, . . . , m }, 

p(D t (0 = o|A) = (&) m ) = = o i y = 6) . 

Using lemma 18.31 and putting together the- previous inequalities, we get 

P(r*<n\D = (b) m ) < rnn® 

as requested. □ 
Let e' > 0. Now 

p(t* > K^-^lDo = (ln£) m ) 

>(r* > £ 2 , D n e TV for £ 2 < n < k £(1 - £) | D = (lni) m ) 



> P 



E P (n p AA for^ 2 < n < D ° - M? 



deAf 



> P { D n e TV for £ 2 < 71 < K^ 1 "^ | T* > £ 2 , L> £ 2 = d 

d>(^(l-e')) m 

x pfr* > £ 2 , D t 2 = d | D = (ln^) m ) . 

Using the Markov property and the monotonicity of the process (_D n ) n >o, 
we have for d > (4(1 - e')) m , 

p(p n eH for £ 2 < n < k £(1 ~ £) | r* > f% Dp = dj 

= p(Vn G {0, . . . , k^ 1 -^ -t 2 } D n eM\D = d\ 
= P(r*> -£ 2 \D = d) >p(t*> k 1 ^ \D = d) 

> P(r* >K e ^\Do^(£ K (l-e')) m ). 
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Reporting in the previous sum, we get 

p(t* > K^ 1 '^ | D = (ln<O m ) > P{t* > n !(1 - e) | D = (4,(1 - e')) m ) 
x p(t* > £ 2 , D t 2 > (4(1 - e')) m | D = (ln^r) . 
We first take care of the last probability. We write 

P(r* > £ 2 , Dp > (4(1 - e')) m | Do = (ln£) m ) > 

p(d p > (4(1 - e')) m | Do = (ln£) m ) - p(r* < £ 2 \ D = (ln^) m ) . 

To control the last term, we use the inequality of lemma 18.121 with n = £ 2 
and b = In t: 

P(T*<e\Do = (Xnl)™)<e m ^ y 
By lemma [8^4} we have 

g(0) /21niy°< 



/21n4 ln£ 



S(lnf) 

whence 

P(r* <e 2 \D = (lnl) m ) < ^ 

For the other term, we use the monotonicity of the process (D n ) n >o an d 
the fact that it has positive correlations (by corollary 15 .4p to get 

P(D r - > (£ K (l-e')) m \D Q = (ln£) m ) 

> P(Dp>(£ K (l-e')) m \D o = {0) m ) 

> 1] P(De-{i)>£ K {l-s')\Do = {Qy 



1 < i < m 

By proposition 18.11 and 18.81 for i £ { 1, . . . , m }, 

p(D i2 (i)>l K (l-e')\D o = (0) m ) 

24 

= P{Y e > 4(1 - e') | Yo = 0) > 1 - — . 
Putting together the previous estimates, we obtain 

P(r* > £ 2 , D P > (4(1 - e')) m | Do = Qn£) m ) > 

24\™ „, /21n4 ln£ 
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It remains to study P(t* > k^ 1 -^ \ D Q = (4(1 - e')) m ) . We use the 
inequality of lemma [5. 121 with n = k;H 1_£ ) and b = £ K (1 — e'): 

P(r* < k 1 ^ | D Q = (4(1 - £ ')D < ^ (1 " e) m- B{0) 



6(4(1 -£')) 



Using the large deviation estimates of lemma [721 we see that, for e' small 
enough, there exists c(e) > such that, for £ large enough, 

P(t* < k^ 1 "^ | D Q = (4(1 - e')) m ) < mexp(-c(e)£) . 

Collecting all the previous estimates, we conclude that, for £ large enough, 

p(r>^-> i D „= <!,-)> (i-^)-(i- T'" 

x (l - mexp(-c(e)£)) ((l - - £ 2 m( 

Moreover, by Markov's inequality, 

e(t* I Do = (lH > k £{1 - s} p(t* > n l{1 ~ e) I Do = (1)' 



21nA ln< 



It follows that 

liminf -Me(t* I D = (l) m ) > (l-e)lnK. 

e,m— >ao, q— >0 £ V / 

Letting e go to yields the desired lower bound. □ 



9 Synthesis 

As in theorem 13. 1[ we suppose that 

I —> +oo , m — ^ +oo , g , 

in such a way that 

772 

a e]0,+oo[, — ->• a G [0, +oo] . 

We put now together the estimates of sections [7J and [8] in order to evaluate 
the formula for the invariant measure obtained at the end of section 16.31 
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For 9 = £,l,we had 
l-pm \ m J 



i=l ^n=0 



E E E f&) I A = i)P(o e A0) = * I ol = oQ 



E(t* I Og - c& it ) + e (to I Zl = ^)P(0 6 T , (0) = i | Og = oL it ) 



Using the monotonicity of (Z^)„>o, wc have 



S(r | 4 = 1) < ££(r | Z« = i)P(O?.(0) = i | 0° = f xit ) 



i = l 



< E(t Zl = m 



With the help of proposition 17. 61 we deduce from the previous inequalities 
that 



l im l\ n J2E{T \Z e =i)P{O e T 40) = i\O e = o e cxit ) = V(p*(a),0). 

By proposition 18.101 for a € [0, +oo[, 

lim - In £(r* I = o^i t ) = In k . 

l,m— >oo, q— ¥0 I 

For the case a = +oo, by corollary 18. Ill and proposition 18.91 
limsup - In (-E(t* \ O e = o c e xit )) < In k . 

Iq^a, ^-^oo 

These estimates allow to evaluate the ratio between the discovery time and 
the persistence time. We define a function tp :]0, +oo[— > [0, +oo[ by setting 

Vae]0,+oo[ i>(a) = V(p*(a),0). 

We first prove the properties of the function "J stated before theorem 13.11 
If a > lncr, then p*(a) = and \I>(a) = 0. If a < In cr, then p*(a) > and 
it follows from lemma [731 that ^(a) > 0. Finally, for a > 0, we have that 

V(p*(a),0) < l(e- a ,0) = In— ^- < +oo. 

1 — e~ a 
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For a £ [0, +00 [ or a = +00, we have 

m 

Y / E(r \Z d =i)P(O 8 T ,(0) = i\O e = 



lim 



, ^ ^0 E(t* I O e = * xit ) 

{0 if a ip{ a ) < m K 
+00 if a ip(a) > In k 

Notice that the result is the same for 9 = £ and 9=1. By proposition ^. 121 
we have 



cxity 



EKE/(t)l^ = V p(o '* (0) = i|o ° = o: 

lim „r° = /(,*). 

q 4o, i q ^a £ £(r I Z e = i)P(0«. (0) = i | O e = <& it ) 

i=i 

Putting together the bounds on v given in section 16.31 and the previous 
considerations, we conclude that 

f ., f ifaV(«)<lnK 
lim / / dv = < 

£,m->oo,g->-0 7(0 1] \ f (p* (a)) H a ip(a) > \n K 

This is valid for any continuous non-decreasing function / : [0, 1] — > M such 
that /(0) = 0. 
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